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ed. The full-length nucleotide sequences of the cDNA 
and amino acid sequences encoded by the nucleotide 
sequences have been determined. Because the cDNA 
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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention relates to polynucleotides encoding novel polypeptides, polypeptides encoded by the 
polynucleotides, and new uses of these. 

BACKGROUND OF THE INVENTION 

w [0002] Currently, the sequencing projects, the determination and analysis of the genomic DNA of various living or- 
ganisms have been in progress all over the world. The whole genomic sequences of more than 40 species of prokary- 
otes, a lower eukaryote, yeast, a multicellular eukaryote, C. elegans, and a higher plants, arabidopsis, etc. are already 
determined. For human genome, presumably having 3 billion base pairs, the analysis was advanced under global 
cooperative organization, and a draft sequence was disclosed in 2001 . Moreover, all the structures are to be clear and 

15 to be disclosed in 2002 - 2003. The aim of the determination of genomic sequence is to reveal the functions of all genes 
and their regulation and to understand living organisms as a network of interactions between genes, proteins, cells or 
individuals through deducing the information in a genome, which is a blueprint of the highly complicated living organ- 
isms. To understand living organisms by utilizing the genomic information from various species is not only important 
as an academic subject, but also socially significant from the viewpoint of industrial application. 

20 [0003] However, determination of genomic sequences itself cannot identify the functions of all genes. For example, 
as for yeast, only the function of approximately half of the 6000 genes, which is predicted based on the genomic 
sequence, was able to be deduced. On the other hand, the human genome has been estimated to contain about 
30,000-40,000 genes. Further, 1 00,000 or more types of mRNAs are said to exist when variants produced by alternative 
splicing are taken into consideration. Therefore, it is desirable to establish "a high throughput analysis system of the 

25 gene functions" which allows us to identify rapidly and efficiently the functions of vast amounts of the genes obtained 
by the genomic sequencing. 

[0004] Many genes in the eukaryotic genome are split by introns into multiple exons. Thus, it is difficult to predict 
correctly the structure of encoded protein solely based on genomic information. In contrast, cDNA, which is produced 
from mRNA that lacks introns, encodes a protein as a single continuous amino acid sequence and allows us to identify 

30 the primary structure of the protein easily. In human cDNA research, to date, more than three million ESTs (Expression 
Sequence Tags) are publicly available, and the ESTs presumably cover not less than 80% of all human genes. 
[0005] The information of ESTs is utilized for analyzing the structure of human genome, or for predicting the exon- 
regions of genomic sequences or their expression profile. However, many human ESTs have been derived from prox- 
imal regions to the 3'-end of cDNA, and information around the 5'-end of mRNA is extremely little. Among human 

35 cDNAs, the number of the corresponding mRNAs whose encoding full-length protein sequences are deduced is ap- 
proximately 13,000. 

[0006] It is possible to identify the transcription start site of mRNA on the genomic sequence based on the 5'-end 
sequence of a full-length cDNA, and to analyze factors involved in the stability of mRNA that is contained in the cDNA, 
or in its regulation of expression at the translation stage. Also, since a full-length cDNA contains atg codon, the trans- 

40 lation start site, in the 5'-region , it can be translated into a protein in a correct frame. Therefore, it is possible to produce 
a large amount of the protein encoded by the cDNA or to analyze biological activity of the expressed protein by utilizing 
an appropriate expression system. Thus, analysis of a full-length cDNA provides valuable information which comple- 
ments the information from genome sequencing. Also, full-length cDNA clones that can be expressed are extremely 
valuable in empirical analysis of gene function and in industrial application. 

45 [0007] Therefore, if a novel human full-length cDNA is isolated, it can be used for developing medicines for diseases 
in which the gene is involved. The protein encoded by the gene can be used as a drug by itself. Thus, it has great 
significance to obtain a full-length cDNA encoding a novel human protein. 

[0008] In particular, human secretory proteins or membrane proteins would be useful by itself as a medicine like 
tissue plasminogen activator (TPA), or as a target of medicines like membrane receptors. In addition, genes for signal 

so transduction-related proteins (protein kinases, etc.), glycoprotein-related proteins, transcription-related proteins, etc. 
are genes whose relationships to human diseases have been elucidated. Moreover, genes for disease -related proteins 
form a gene group rich in genes whose relationships to human diseases have been elucidated. 
[0009] Therefore, it has great significance to isolate novel full-length cDNA clones of human, only few of which has 
been isolated. Especially, isolation of a novel cDNA clone encoding a secretory protein or membrane protein is desired 

55 since the protein itself would be useful as a medicine, and also the clones potentially include a gene involved in diseases. 
In addition, genes encoding proteins that are involved in signal transduction, glycoprotein, transcription, or diseases 
are expected to be useful as target molecules for therapy, or as medicines themselves. These genes form a gene 
group predicted to be strongly involved in diseases. Thus, identification of the full-length cDNA clones encoding those 
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proteins has great significance. 

SUMMARY OF THE INVENTION 

5 [0010] An objective of the present invention is to provide polynucleotides encoding novel polypeptides, polypeptides 
encoded by the polynucleotides, and novel usages of these. 

[0011] The inventors have developed a method for efficiently cloning, from a cDNA library having very high fullness- 
ratio, a human full-length cDNA that is predicted to be a full-length cDNA clone, where the cDNA library Is synthesized 
by an improved method (WO 01/04286) of the oligo-capping method (K. Maruyama and S. Sugano, Gene, 138: 171 -1 74 
10 (1994); Y. Suzuki et al., Gene, 200: 149-156 (1997)). Then, the nucleotide sequences of cDNA clones whose fullness 
ratio is high, obtained by this method, were determined mainly from their 5'-ends, and, if required, from 3'-ends. 
[0012] Further, representative clones, which were estimated to be novel and full-length, among the clones obtained, 
were analyzed for their full-length nucleotide sequences. The determined full-length nucleotide sequences were ana- 
lyzed by BLAST homology search of the databases shown below. Because the homology search of the present inven- 
ts tion is carried out based on the information of full-length cDNAs including the entire coding regions, homology to every 
part of a polypeptide can be analyzed. Thus, in the present invention, the reliability of homology search has been 
greatly improved. 

[1] SwissProt 

20 (http://www.ebi.ac.uk/ebi_docsSwissProt_db/swisshome.html), 
[2] GenBank (http://www.ncbi.nlm.nih.gov/web/GenBank), 
[3] UniGene (Human) (http://www.ncbl.nlm.nih.gov/UniGene), and 

[4] nr (a protein database, which has been constructed by combining data of coding sequences (CDS) in nucleotide 
sequences deposited in GenBank, and data of SwissProt, PDB (http://www.rcsb.org/pdb/index.html), PIR (http:// 
25 pir.georgetown.edu/pirwww/pirhome.shtml), and PRF (http://www.prf.or.jp/en/); overlapping sequences have been 

removed.) 

[0013] Further, the gene expression profiles of cDNA clones whose full-length nucleotide sequence had been deter- 
mined were studied by analyzing the large-scale cDNA database constructed based on the 5'-end nucleotide sequenc- 
30 es of cDNAs obtained. In addition to the analysis for the expression profile by computer, the profiles of gene expression 
in living cells were also determined by PCR. The present inventors revealed the usefulness of the genes of the present 
invention based on these analysis results. 

[0014] In the present invention, gene functions were revealed by the analysis of expression profiles in silico based 
on the information of full-length nucleotide sequences. The expression profiles used in the expression frequency anal- 

35 ysis were studied based on the database containing sufficient amount of fragment sequence data. The expression 
frequency analysis was carried out by referring, for these expression profiles, to the full-length nucleotide sequences 
of many cDNA clones obtained in the present invention. Thus, a highly reliable analysis can be achieved by referring 
to the full-length nucleotide sequences of a wide variety of genes for the sufficiently large population for analysis (ex- 
pression profiles). Namely, the results of expression frequency analysis using the full-length sequences of the present 

40 invention more precisely reflect the gene expression frequency in tissues and cells from which a certain cDNA library 
was derived. In other words, the information of full-length cDNA nucleotide sequence of the present invention made it 
possible to achieve the highly reliable expression frequency analysis. 

[001 5] The full-length cDN A clones of this invention were obtained by the method comprising the steps of [1 ] preparing 
libraries containing cDNAs with the high fullness ratio by oligo-capping, and [2] assembling 5'-end sequences and 

45 selecting one with the highest probability of completeness in length- in the cluster formed (there are many clones longer 
in the 5'-end direction). However, the uses of primers designed based on the 5- and 3'-end sequences of polynucle- 
otides provided by the present invention enable readily obtaining full-length cDNAs without such a special technique. 
The primer, which is designed to be used for obtaining cDNAs capable of being expressed, is not limited to the 5'- and 
3'-end sequences of polynucleotide. 

50 [0016] Specifically, the present invention relates to a polynucleotide selected from the group consisting of the fol- 
lowing (a) to (g): 

(a) a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID NOs 
shown in Table 1 ; 

55 (b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs shown 

in Table 1; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid se- 
quence of any one of SEQ ID NOs shown in Table 1, wherein, in said amino acid sequence, one or more amino 
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acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes a 
polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing under stringent conditions to a polynucleotide comprising the nucleotide sequence 
of any one of SEQ ID NOs shown in Table 1 , wherein said nucleotide sequence encodes a polypeptide functionally 

5 equivalent to a polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypeptide 
encoded by the polynucleotide according to any one of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence of 
(a); and 

10 (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence of 

(a). 

[0017] The present invention also relates to a polypeptide encoded by the above-mentioned polynucleotide or a 
partial peptide thereof, an antibody binding to the polypeptide or the peptide, and a method for immunologically assaying 
15 the polypeptide or the peptide, which comprises the steps of contacting the polypeptide or the peptide with the antibody, 
and observing the binding between the two. 

[0018] Furthermore, the present invention features a vector comprising the above-mentioned polynucleotide, a trans- 
formant carrying the polynucleotide or the vector, a transformant carrying the polynucleotide or the vector in an ex- 
pressible manner, and a method for producing the polypeptide or the peptide, which comprises the steps of culturing 

20 the transformant and recovering an expression product. 

[0019] Another feature of the present invention is an oligonucleotide comprising at least 15 nucleotides, said oligo- 
nucleotide comprising a nucleotide sequence complementary to the nucleotide sequence of any one of SEQ ID NOs: 
1 to 1 970 or to a complementary strand thereof. This oligonucleotide can be used as a primer for synthesizing the 
above-mentioned polynucleotide or used as a probe for detecting the polynucleotide. The present invention includes 

25 an antisense polynucleotide against the polynucleotide or a part thereof, and a method for detecting the polynucleotide, 
which comprises the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide. 

30 

[0020] Still another feature of the present invention is a database of polynucleotides and/or polypeptides, said da- 
tabase comprising information on at least one of the nucleotide sequences of SEQ ID NOs: 1 to 1970 and/or on at 
least one of the amino acid sequences of SEQ ID NOs: 1 971 to 3940. 

[0021] Herein, "polynucleotide" is defined as a molecule, such as DNA and RNA, in which multiple nucleotides are 

35 polymerized. There are no limitations on the number of the polymerized nucleotides. In case that the polymer contains 
relatively low number of nucleotides, it is also described as an "oligonucleotide", which is included in the "polynucleotide" 
of the present invention. The polynucleotide or the oligonucleotide of the present invention can be a natural orchemically 
synthesized product. Alternatively, it can be synthesized using a template polynucleotide by an enzymatic reaction 
such as PCR. Furthermore, the polynucleotide of the present invention may be modified chemically. Moreover, not only 

40 a single-strand polynucleotide but also a double-strand polynucleotide is included in the present invention. In this 
specification, especially in claims, when the polynucleotide is described merely as "polynucleotide", it means not only 
a single-strand polynucleotide but also a double-strand polynucleotide. When it means double-strand polynucleotide, 
the nucleotide sequence of only one chain is indicated. However, based on the nucleotide sequence of a sense chain, 
the nucleotide sequence of the complementary strand thereof is essentially determined. 

45 [0022] As used herein, an "isolated polynucleotide" is a polynucleotide the structure of which is not identical to that 
of any naturally occurring polynucleotide or to that of any fragment of a naturally occurring genomic polynucleotide 
spanning more than three separate genes. The term therefore includes, for example, (a) a DNA which has the sequence 
of part of a naturally occurring genomic DNA molecule in the genome of the organism in which it naturally occurs; (b) 
a polynucleotide incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such 

50 that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate molecule 
such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PCR), or a restriction 
fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion 
polypeptide. Specifically excluded from this definition are polynucleotides of DNA molecules present in mixtures of 
different (i) DNA molecules, (ii) transfected cells, or (iii) cell clones; e.g., as these occur in a DNA library such as a 

55 cDNA or genomic DNA library. 

[0023] The term "substantially pure" as used herein in reference to a given protein or polypeptide means that the 
protein or polypeptide is substantially free from other biological macromolecules. For example, the substantially pure 
protein or polypeptide is at least 75%, 80%, 85%, 95%, or 99% pure by dry weight. Purity can be measured by any 
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appropriate standard method known in the art, for example, by column chromatography, polyacrylamide gel electro- 
phoresis, or HPLC analysis. 

[0024] All the cDNAs provided by the present invention are full-length cDNAs. The "full-length cDNA" herein means 
that the cDNA contains the ATG codon, which is the start point of translation therein. The untranslated regions upstream 
5 and downstream of the protein-coding region, both of which are naturally contained in natural mRNAs, are not indis- 
pensable. It is preferable that the full-length cDNAs of the present invention contain the stop codon. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 [0025] 

Figure 1 shows the restriction map of the vector pME18SFL3. 
DETAILED DESCRIPTION OF THE INVENTION 

15 

[0026] All the clones (1 970 clones) of the present invention are novel and encode the full-length polypeptides. Further, 
all the clones are cDNAs with the high fullness ratio, which were obtained by oligo-capping method, and also clones 
which are not identical to any of known human mRNAs (namely, novel clones) selected by searching, for the 5*-end 
sequences, mRNA sequences with the annotation of "complete cds" in the GenBank and UniGene databases by using 
20 the BLAST homology search [S. F. Altschul, W. Gish, W. Miller, E. W. Myers & D. J. Lipman, J. Mol. Biol., 215: 403-410 
(1990); W. Gish & D. J. States, Nature Genet., 3: 266-272 (1993)]; they are also clones that were assumed to have 
higher fullness ratio among the members in the cluster formed by assembling. Most of the clones assessed to have 
high fullness ratio in the cluster had the nucleotide sequences longer in the 5*-end direction. 

[0027] All the full-length cDNAs of the present invention can be synthesized by a method such as PCR (Current 
25 protocols in Molecular Biology edit. Ausubel et al. (1987) Publish. John Wiley & Sons Section 6.1-6.4) using primer 
sets designed based on the 5'-end and 3*-end sequences or using primer sets of primers designed based on the 5"- 
end sequences and a primer of oligo dTsequence corresponding to poly A sequence. Table 1 contains the clone names 
of full-length cDNA of 1 970 clones of the present invention, SEQ ID NOs of the full-length nucleotide sequences, CDS 
portions deduced from the full-length nucleotide sequences, and SEQ ID NOs of the translated amino acids. The 
30 positions of CDS are shown according to the rule of "DDBJ/EMBL/GenBank Feature Table Definition" (http://www.ncbi. 
nlm.nih.gov/collab/FT/index.html). The start position number corresponds to the first letter of "ATG" that is the nucle- 
otide triplet encoding methionine; the termination position number corresponds to the third letter of the stop codon. 
These are indicated being flanked with the mark V. However, with respect to the clones having no stop codon, the 
termination position is indicated by the mark V according to the above rule. 

35 



Table 1 





Clone name 


SEQ ID NO. of nucleotide 
sequence 


Position of CDS 


SEQ ID NO. of amino acid 
sequence 


40 


ADRGL20020290 


1 


62.. 1669 


1971 




ADRGL20021910 


2 


150.. 707 


1972 




ADRGL20022600 


3 


464..814 


1973 




ADRGL20023920 


4 


375.. 1853 


1974 




ADRGL20026790 


5 


113..2371 


1975 


45 


ADRGL20027530 


6 


2013..2372 


1976 




ADRGL20036380 


7 


1129..1557 


1977 




ADRGL20036840 


8 


260..889 


1978 




ADRGL20040310 


9 


539.. 844 


1979 


50 


ADRGL20040770 


10 


817.. 1227 


1980 




ADRGL20046760 


11 


1108..1467 


1981 




ADRGL20047080 


12 


823.. 11 34 


1982 




ADRGL20047770 


13 


1532.. 1897 


1983 




ADRGL20057560 


14 


376.. 846 


1984 


55 


ADRGL20059610 


15 


969.. 1961 


1985 




ADRGL20062330 


16 


799..>2117 


1986 




ADRGL20063770 


17 


344.. 664 


1987 



5 



EP 1 308 459 A2 



Table 1 (continued) 



Clone name 


SFO ID NO of nucleotide 


Position of CDS 

1 vO 1 LI w 1 1 Ul VL/O 


CCO \r\ wri nf aminn a/*iH 
OlW ILy INV-V. Ol all III IU aUlU 




sequence 




sequence 


ADRGL20066770 


18 


22. .1416 


1Q88 


ADRGL20067320 


19 


276. .1016 


1989 


ADRGL20079060 


20 


63.. 1748 


1QQ0 


ADRGL20095330 


21 


929 1516 


1QQ1 


ASTR020001910 


22 


1753 !>221fi 

1 / «J»J . . «^tt | O 


1 QQ9 


ASTR020003720 


23 


2089 2586 


1QQ1 


ASTR020004820 


24 


444., 1040 


1QQ4 


ASTR020006530 


25 


2.. 1123 


1QQ5 


ASTR020009140 


26 


344.. 1714 


1QQR 


ASTR0200 10010 


27 


1236 1727 


1QQ7 


ASTR0200102QO 


28 

to 


9 745 
t . . / to 


1QQ8 


ASTR0200 12270 


29 


98R R97 


1QQQ 


ASTR020020240 


30 


19 115 
i £...000 


9000 
tUUU 


ASTR020020350 


31 


1*384 1854 
1 oot., 1 oot 


9001 
tUU I 


ASTR020022020 


32 


4R7 10Q1 

to / . . 1 UjO 


9009 
tout 


ASTR02Q026320 


JO 


t/ O.xlD I 


tUUO 


ASTR020027330 


14 


481 1101 

to 1 . . 1 1 O I 




ASTR020038400 


15 

oo 


59 9095 

Ot . . tUtO 


tUUO 


ASTR090045840 

no 1 PlVJt U UtJOtU 


Ifi 


9R1 1051 
too. . 1 UOl 


tollo 


ASTRG2004R980 


17 

O/ 


1 40 1 4.09 


0007 


ASTR020047510 


18 
oo 


1940 1R11 

I t4U.. ID 1 1 


tUUO 


ASTR0200508 1 0 


1Q 

OJ> 


171 1 RQ4 
1 / 1 . . 1 oyt 


tuuy 


ASTR020052420 

no 1 (IvtU wJtttU 


40 


11RR 91R9 
1 1 OO..tOOt 


001 0 

tU 1 U 


A ST R09 00 5 ^4 10 
no i nuiiuyjoHOU 


41 

t 1 


91 n iqqc 

t 1 O. . 1 OOO 


om 1 
tUl 1 


ASTR0900555 10 

/AO 1 riVLUUJudOU 


49 


90Q R£Q 


001 0 
tin t 


ASTR020055570 
/ao i nutuujjj / o 


41 
to 


949 711 
t*ft.. / Oo 


001 Q 
to 1 0 


ASTR020055Q30 


44 


oto. . 1 uoo 


901 4 
tU 1 


ASTR020058Q60 


45 
to 


55 1915 

OO.. I t IO 


ooi t\ 

tU 1 0 


A^TR0900RQ9no 

/AO 1 nu^UUOS^UU 


4fi 
**o 


71 1 OQT 

/ I ..1 to/ 


001 a 


ASTR0200751 SO 

/AO 1 1 \\JC \J\JI \J I \J\J 


47 
t / 


1004 17Q5 
I out. . 1 / ?JO 


901 7 
tU 1 / 


ASTR09007RRR0 

/AO 1 f IVbUvi uuuu 


48 
to 


15Q4 1QR8 


001 Q 
tUl O 


ASTR090085080 

no 1 I lu£.UwuwvUv 


4Q 
to 


470 9011 


90 1 Q 

tui y 


ASTR020088950 


50 


145 1 510 
oto.. 1 JOU 


J 9090 
tutu 


ASTR020G8Q600 


51 

O 1 


149 1195 
1 ft.. 1 1 to 


9091 
tut I 


ASTR0200QOR80 

/AO 1 1 1U£UuC/V/UOv 


Ot 


1991 *a.9fi11 

I tt I . .>tOO t 


tU^t 


ASTR0200Q11 80 

/AO 1 1 t\J£-\J\J ~ 1 1 DU 


5a 
oo 


19 1914 


tUtO 


ASTR0200Q1770 


54 
ot 


10 118 
1 0. .0 1 0 


909/1 


ASTR0201 41 740 


55 

oo 


qn 147 
00. .ot / 


909R 
tUto 


BGG1 120000670 


58 
oo 


940 R14 


909R 
tUto 


BGG1 12001 0750 


57 
o / 


79 ->9507 

/ t . ,*>tOU / 


9097 
tut/ 


BNGH41 0000570 


58 
oo 


Q57 9097 


909fl 
tUtO 


BNGH420008150 


59 


596.. 1606 


2029 


BNGH420014060 


60 


1072.. 1413 


2030 


BNGH420015760 


61 


92.. 1336 


2031 


BNGH420021680 


62 


147.. 2093 


2032 


BNGH420023870 


63 


301. .1851 


2033 


BNGH420024870 


64 


155.. 1960 


2034 


BNGH420035290 


65 


114..2126 


2035 


BNGH420036410 


66 


1778.. 21 43 


2036 



6 



EP 1 308 459 A2 



Table 1 (continued) 



5 



10 



15 



20 



25 



30 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


BNGH420040760 


67 


698.. 11 62 


2037 ' 


BNGH420042910 


68 


191. .721 


2038 


BNGH420045380 


69 


721.. 1254 


2039 


BNGH420046790 


70 


1079.. 1429 


2040 


BNGH420052350 


71 


787.. 11 46 


2041 


BNGH420059680 


72 


21 3.. 2393 


2042 


BNGH420061350 


73 


600. .3131 


2043 


BNGH420062340 


74 


294,. 641 


2044 


BNGH420070370 


75 


567,. 2429 


2045 


BNGH420074600 


76 


125.. 1696 


2046 


BNGH420075940 


77 


31. .510 


2047 


BNGH42007798 


78 


288..2147 


2048 


BNGH420085100 


79 


243.. 569 


2049 


BNGH420086030 


80 


107..>2556 


2050 


BNGH420087430 


81 


195.. 1835 


2051 


BRACE1 0000510 


82 


642 1703 


2052 


BRACE20003310 


83 


1379 2893 


2053 


BRACE20007330 


84 


339 2015 


2054 


BRACE20009050 


85 


1023 1493 


2055 


BRACE200 14450 


86 


161 1096 


2056 


BRACE20017790 


87 


304.. 639 


2057 


BRACE20018810 


88 


397 1146 

\J \J /..II ^ w 


2058 


BRACE20025820 


89 


453.. 767 


2059 


BRACE20038920 


90 


1122 1463 


2060 


BRACE20050870 


91 


110 1987 




BRACE20051600 


92 


443 895 


2062 


BRACE20051930 


93 


173. .901 


2063 


BRACE20052430 


94 


825 1637 


2064 


BRACE20052530 


95 


98. .490 


2065 


BRACE20054080 


96 


762 1382 


2066 


BRACE20054480 


97 


111. .902 


2067 


BRACE20054600 


98 


392 1276 


2068 


BRACE20055560 


99 


136.. 735 


2069 


BRACE20057870 


100 


1169 1825 


2070 1 


BRACE20059110 


101 


1452. .1910 


2071 


BRACE20059810 


102 


689 2218 


2072 


BRACE20061620 


103 


162.. 11 63 


2073 


BRACE20062580 


104 


1164 1859 


2074 


BRACE20063540 


105 


378.. 1670 


2075 


BRACE20065470 


106 


427.. 11 01 


2076 


BRACE20066360 


107 


233..736 


2077 


BRACE20068710 


108 


1099.. 1440 


2078 


BRACE20069000 


109 


1355.. 2305 


2079 


BRACE20069110 


110 


576.. 91 7 


2080 


BRACE20069440 


111 


278.. 1504 


2081 


BRACE20079200 


112 


928.. 141 3 


2082 


BRACE20079370 


113 


158.. 1522 


2083 I 


BRACE20097540 


114 


1474.. 21 03 


2084 


BRACE20098860 


115 


693..1193 


2085 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO ID NO of amino arid 




sequence 




sequence 


BRACE20099070 


116 


53.. 1441 


2086 


BRACE201 94670 


117 


11. .616 


2087 


BRACE20196180 


118 


35..916 


2088 


BRACE201 96960 


119 


1454.. 1912 


2089 


BRACE20200770 


120 


306. .683 


2090 


BRACE20200970 


121 


426. .764 


2091 


BRACE20204670 


122 


760. .2124 


2092 


BRACE20205840 


123 


40.. 387 


2093 


BRACE20207420 


124 


11 9. .469 


2094 


BRACE20212450 


125 


168 590 


20Q5 

£U9w 


BRACE20215410 


126 


111. .1361 


20QR 


BRACE20216700 


127 


1403 1738 


20Q7 


BRACE202 16950 


128 


911 1315 


20QR 


BRACE202 19360 


129 


198 596 


20QQ 


BRAMY1 0000980 


130 


254..616 


2100 


BRAMY1 0001 730 


131 


796 1158 


21 01 


BRAMY20000210 


132 


134. .445 


91 H9 
c. I U£ 


BRAMY20000250 


133 


190 1932 


210^ 

£ 1 vO 


BRAMY20001510 


134 


129. .917 


£ 1 u*+ 


BRAMY20003540 


135 


144 2477 


21 


BRAMY20003880 


136 


1 91 808 


21 or 


BRAMY20005080 


137 


1 fi?R 1Q5ft 


91 07 


BRAMY20013670 


138 


551 2ftfi1 

JJ 1 ..COO 1 


91 nn 


BRAMY20016780 


139 


£ t O. . 1 tJOsJ 


91 no 

£ 1 Ua 


BRAMY20020440 


140 


OJ?. . UOJ 


91 1 n 

cl IU 


BRAMY20021580 


141 


67. .555 


9111 


BRAMY20023390 


142 


1 JuO, . 1 \J\J\j 


2112 

£ 1 1 £ 


BRAMY20023640 


143 


1RR4 22fl0 


91 1 

£.110 


B RAM Y20 024790 


144 


07R fipfi 
C- I U.,U£.U 


9114 


BRAMY20027390 


145 


420.. 782 


911 R 

£ 1 1 0 


BRAMY20027990 


146 


52Q 1572 


211ft 


BRAMY20028530 


147 


Q7Q 1 97ft 
cJ / O. . 1 £/ O 


2117 
£11/ 


BRAMY20028620 


148 


1048.. 1434 


2118 


BRAMY20035380 


149 


925 1707 


211Q 


BRAMY20035830 


150 


21 9. .977 


2120 

£ 1 £U 


BRAMY20036530 


151 


1411. .1761 


21 21 

£ 1 £ 1 


BRAMY20036810 


152 


321. .644 


21 22 

£ 1 ££ 


BRAMY20038980 


153 


715 >2057 


21 2^ 

£ 1 £w 


BRAMY20039290 


154 


81. .1043 


2124 


BRAMY20040580 


155 


374 769 


21 2R 

£ 1 £3 


BRAMY20043520 


156 


942.. 1778 


21 2fi 

£ I £0 


BRAMY20043630 


157 


25.. 1119 


2127 


BRAMY20044920 


158 


40.. 1947 


2128 


BRAMY20045210 


159 


367..750 


2129 


BRAMY20045420 


160 


4..888 


2130 


BRAMY20047560 


161 


220..726 


2131 


BRAMY20050640 


162 


1802.. 2635 


2132 


BRAMY20050940 


163 


23.. 385 


2133 


BRAMY20051820 


164 


1411..2157 


2134 
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Table 1 (continued) 



V-/I \j\ ic 1 1 cii i ic 


SFO ID NO csf nnrlAntirfp 

OUm 1 LJ INVJ, Ul 1 1 UL-IUULlUt; 


rUOIllUII vJI OUO 


uCU IU INVJ. ot dmino aulU 




sequence 




cpn 1 ipnpp 


B RAM Y20052440 


1 uo 


?Q 44R 


9I qc; 


BRAMY20053910 


166 


11 ..358 


91 QR 
c. I OO 


BRAMY20055760 

L-/ J 1*11 » 1 1 ^ Www / W 


167 


664 99^^ 


91 ^7 


BRAMY20056620 


168 


4fi 79R 


91 Qfl 

^1 JO 


BRAMY20056840 

Ul I'll VI | bUU^UUtU 


169 


40 1 ^Q9 


91 QQ 

^1 oy 


BRAMY20063750 


170 
i / \j 


cqc 91 rr 


91/in 

c. 1 4U 


BRAMY20072440 

i— ^ i ini»i i luu( b f I v 


171 


R9Q 1 fi9A 


91 A1 


BRAMY20072870 


179 
1 / c 


4Q7 qio 


91 AO 


BRAMY2G073080 


173 
\ / o 


o. ,ojj 


91 Aq 
<i i ^o 


RRAMY200741 1 0 

LJ 1 lr\IVI I lUU / HI IV 


174 
i /t 


q-i>l pop 
O I *f . .ooo 


^144 


RRAMY90074RR0 

□ riAIVI 1 C\J\J 1 HOOV 


1 7R 
I / o 


P 1 OO.. 1 Ott 


91 /IK 


RRAMY9007R1 00 
lj i mivi i i \j i uy 


1 7R 


A^Q 101Q 

*foy. . I u i y 


^ 1 40 


RRAMY9007R1 30 

LJ 1 irAIVI I £UU/ U 1 OV/ 


1 77 
I / / 


CQ 070 

oy ..of o 


91/17 


R RAM Y9007fi c ,30 


1 7ft 
I / o 




91/1Q 


B RAM Y90083330 


17Q 


10 AQ9 


9140 

c. \ 4y 


RRAMY900R3R90 


1 R0 


R9^ 


91 

£ IOU 


RRAMY900RQ770 


1 R1 

1 O 1 


i /o.. yoo 


t IOI 


RRAMY900Q1 930 


1 R9 


OH Q 1 C"7Q 

o io..io/y 




RRAMY900Q34Q0 

DnnlVI I tUUv>OH5JU 


1 R3 
I OO 


O0/..IU0U 


Ol CQ 


RRAMY900Q4RQ0 

LJ llMIVI I tyU3403y 


1 RA 


Qrt QH QQ 
0\J..£. 1 OO 


01 C>1 


RRAMY900QR0R0 


1 R^ 




CC 

^loo 


RRAMY900QRR70 
d riMivi i ^uuyoo / u 


1 Rfi 


i no Qn7 


OH CC 
/il OO 


RRAMY900QRQ30 
Ljn/Aivi t tuyyoyou 


1 R7 
I Of 


h qq-4 i onn 
1 001 ..1 oUU 


OH C*7 
ill O/ 


RRAIMV901 OORRO 
DriArvl T /1U 1 UUOOU 


I OO 


^oo..o/U 


OH CO 
^lOO 


RRAMY901 09Q00 
lj riMivi t ) y^yyy 


1 RQ 

1 Oo 


9nn 7ftn 

ZUU. . / ou 


OH CQ 


RRAMY901 07QRO 
d nrtivi i i u / you 


1 on 
i yu 


o4o..ooy 


oh cn 
^1 OU 


RRAMY901117RO 
unnivi T c.\j ill/ ou 


1 Q1 
i a i 




OH ft1 
/ll Ol 


RRAMY901 1 7R70 

D 11 AVIV I T c\> \ 1 / O / U 


1 Q9 


ftQ 7QO 
Oo.. /O^ 


OH CO 
<ll O/l 


RRAMY901 1 R/11 0 
DrlMWI T C\i 1 1 OM- 1 U 


1 OQ 


O/l TOO 


OH CO 

21 00 


RRAMY901 1 R4Q0 
D ii/AIVi T I 1 0*tyu 


1 O/l 

i y^ 


QO 1Ck\ 


OH C>l 

21 o4 


RRAMY901 901 70 


1 OR 

i yo 


H H H /ICQ 


OH CC 
21 OO 


RRAMY901 93ziOO 

LJ 11 Ml VI T £.\J 1 cO*fUU 


1 QR 

i yo 


H 7cc 


oh ee 
21 Ob 


RRAMY901 94Q70 


1 Q7 

1 y / 


R9^ 1 99R 


21 0/ 


RRAMY901 9R1 70 


1 on 
i yo 


•\ K7 C70 

1 0/ ..o/y 


OH CO 

2lbB 


RRAMY901 9KqfiO 

DriMIVI T <iU I iiOODVJ 


1 QQ 

i yy 


1 J4.. 1 UoU 


OH CO 

21 by 


RRAMY901 9R5R0 


9on 


90 1 7/1 7 

^y..i /4/ 


OH "7H 

2l /U 


RRAMY901 9RQ1 0 


901 


11/1 CH Q 
1 1 4. .Ol O 


OH 7i 
21 /l 


RRAMY901 97310 
lj rir\ivi i cy ic/o iu 


909 
CMC. 


1 QRR 90Qfi 

i yoo..^ooo 


OH 70 

21/2 


RRAMY201 977R0 


90^ 


^17 RQ1 


91 7Q 
21/0 


RRAMY901 340R0 


904 


1 OO ^99 


91 7/1 

21/4 


RRAMY901 3R790 


90R 


t;7 Am 

O/ ..4U1 


91 7C 

21 /o 


BRAMY201 37360 


206 


976..2193 


2176 


B RAM Y20 139440 


207 


2.. 1597 


2177 


BRAMY201 39750 


208 


88.. 435 


2178 


BRAMY201 43870 


209 


1419..2102 


2179 


BRAMY201 52510 


210 


296.. 1993 


2180 


BRAMY201 55500 


211 


722.. 1069 


2181 


BRAMY201 58550 


212 


142.. 951 


2182 


BRAMY201 59250 


213 


286.. 810 


2183 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO in NO of amino arid 




sequence 




sequence 


B RAM Y20 160020 


214 


143. .919 


21 84 


BRAMY201 73480 


215 


1544.. 1906 


2185 


BRAMY201 90550 


216 


11 4.. 1640 


2186 


BRAMY201 94680 


217 


1179 1517 

1 1 / \J * . 1 w 1 / 


2187 


BRAMY20204270 


218 


218. .844 


2188 


BRAMY20206340 


219 


237 180*5 

bW # •■ 1 U Uv 


C \ 0\7 


BRAMY20219620 


220 


1014 1670 


P1 QO 

L 1 90 


BRAMY20221 600 


221 


168 QQ? 


P1 Q1 


BRAMY20223010 


222 


466 1257 

^U U ■ . 1 LU f 


P1 QP 


BRAMY20225250 


223 


qfift f?7Q 
ouu. ,u / o 


P1 


BRAMY20225320 


224 


P41 840 

4tH 1 . . U4U 


P1 Q4 

l I 5?H 


B RAM Y20227230 


225 


1 *3P7 P0R7 

1 JL / . XVO / 


P1 QR 


BRAMY20227860 

LJI 1*11 VI | LULL / UUU 


226 


1 0 R*i7 


OA Qfi 


BRAMYP02P7Q60 

unnivi I lull / ouu 


PP7 

LL / 


1 1 4ft 1 RRft 
I I HO.. I ooo 


OA Q7 


BRAMY20231150 

LJI l/l I V 1 | LULU | | U U 


228 


1 QQ 114R 


OA Qfl 

^ 1 yo 


B RAM Y20234820 

i inivi i lvLvtulv 


229 


1QQ P07Q 


OA QQ 

*l 1 yy 


BRAMY202371 QO 

LJ 1 1/^1 VI | LUlJ / 1 \J\J 


P^O 


9Q7 R70 

Oar ..O / \J 




BRAMY20P38630 

LJ 1 l/^IVI I CvtvOUuU 


P^1 
lO 1 


OUO. . 1 OO 1 




BRAMY20P431 20 

L>l 1/llVI 1 LUb^O ICV/ 


P^P 


i7cc P451 


OOCiO 


B RAM Y202444Q0 


233 


OC7 QQO 


oocvx 


BRAMY20245140 


234 


1PQR 

o. . i ljjo 


oonA 


BRAMY20245350 


P15 


Q4 7RO 

OH. . / OO 




BRAMYP0P4 f v7fi0 

LJ 1 1/AIVI I tyttd/ ou 




oc 1 07c 




BRAMYP0P51 P1 0 

LJI 1/1 1 VI 1 LVkJ IC 1 V 


PT7 


00. ,yoo 


Qon7 


RRAMYPOP^I 7^0 

LJ ll /A 1 VI I lUlj 1 / uU 




Q7 PAR 

y / . .oho 


OOOQ 

tttiVo 


RRAMYPOPfi^OOO 

LJ 1 lAAIVI I lOlOOOOO 


P^Q 


pir 1 

^ ID.. I OOO 




BRAMY20267780 

LJI InlVI 1 LVLU/ / uu 


P40 


1 1 Q4 1 70R 

1 1 vJH.. 1 / UO 


00 \ n 


RRAMYP0PRQ040 


P41 
£h 1 


QQQ 

yoy..^uv 1 




RRAMYP0P71 1 40 
o nrMvi I lUl i i \ ho 


P4P 


1 £R9 OOIQ. 
1 0D£..££OO 


001 0 


RRAMYP0P7451 0 

LJ ll /A 1 VI 1 l Ul / HO 1 O 


P4A 


17RR 01 QQ 


OOi O 
<lt lo 


B RAM YP0P85650 

1 l/l IV 1 1 LULUJUvIv 


P44 


lO. OOt 


LL 1 H 


BRAMY20P874.nO 


P45 






BRAWH2001 4590 

LJ 1 1*1 V ¥ 1 IL.VV 1 ""TW CJ U 


246 


IOC QCfi 

1 LJ..OJI) 


OO A R 


BRAWHP0020470 

1— J 1 l/l V ¥ 1 ICl/V/tUt/ l/ 


P47 


Hon -S.P190 

1 O 1 1 OO 


-1 7 
C.C. 1 / 


B RAWHP0020600 

1— * 1 1/1 ¥ ¥ 1 ILUUL.UUUU 


P4ft 


40P 7PP 


0 


BRAWH20021Q10 

1 1/1 V ¥ 1 ILUUL 1 U 1 U 


P4Q 


qq4 iftn^ 

Oo4.. 1 OOO 


1 y 


BRAWH20025490 

LJ 1 1/1 V ¥ 1 ILUULUT JU 


P^O 

CSV 


irqq pinfi 




BRAWH2002601 0 

LJ 1 1/1 V ¥ 1 ILUvcUU I U 


P51 


Q07 P0^4 
OO / . .tOOH 


OOO A 


BRAWH20027250 


252 


Q4P 14QQ 

U1L , . 1 


0000 


BRAWH20030000 

LJ 1 1/1 ¥ ¥ 1 ILUUUUUUU 




Ofl1 Ipftfi 

OO 1 .. 1 LOO 




B RAWH20039640 

LJ 1 l/l ¥ ¥ 1 ILvUUUUtU 


PR4 
tat 


10Q «>PPfl1 


OOOA 


BRAWH20040680 


255 


201. .2291 


2225 


BRAWH20047790 


256 


290..631 


2226 


BRAWH20050740 


257 


512..>1907 


2227 


BRAWH20055240 


258 


1339.. 1653 


2228 


BRAWH20055330 


259 


1507.. 1911 


2229 


BRAWH20055780 


260 


354.. 953 


2230 


BRAWH20058120 


261 


896.. 1501 


2231 


BRAWH20063010 


262 


1 839..2579 


2232 
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Table 1 (continued) 



10 



15 



20 



25 



30 



35 



Clone name 




Pncitinn nf 


otu \u w\j. 01 arnino acia 




sequence 




<sen 1 ipnfA 

OC^ UCI I L/C 


BRAWH20078080 


263 


1 R1 745 


ooqq 


BRAWH20078620 


264 


qq /TO 


e^O*fr 


BRAWH20080580 


265 


35 1000 

wJ. . 1 UvU 


99*35 
eeOO 


BRAWH20082550 


266 


91 Q1 9R70 

e I JJ 1 . .eU f U 


£41 OO 


BRAWH20082920 


267 


157Q 9194 
I o / a. .e I e*t 


OOQ7 
<£eo/ 


BRAWH20093040 


268 


1 909 1 R55 
1 eve. . 1 OOO 




BRAWH20093070 


269 


4RQ 1 5RQ 

fUl7. . I JQ9 




BRAWH20094900 


270 


719 91RR 

/ 1 e. .e 1 DO 


9940 


BRAWH20095900 


271 


247 91 7R 

tTl ..el / O 


OOAi 


BRAWH20 173790 

i— » i lnin lev i t \j i <j kj 


272 

e / e 


5*34 1 4.9ft 
D04.. 1 *+eo 


OOAO 
d.cAd. 


BRAWH201 74330 

Ul lrVYTI lev 1 / tJOU 


973 

e/ O 


95Q.9 9.979 
eOoe. .Oe/ e 


^4o 


BRAWH201 75230 

UI 1/V V ¥ 1 1 / VeVV 


974 


I J I y.. I401 




BRAWH2Q1 75340 

LJ i mv ¥ 1 lev 1 / JJHU 


975 


O0..OUH 


e^40 


BRAWH201 76850 

i— ' i inf 1 1 lev i / u uu v 


91 'fi 
e / O 


99 9909. 
Oe. .eeUO 




BRAWH201 82670 

u i i#\ » w i iev i ve v # v 


977 

e/ / 


e/O 1 ..OUO? 




B RAWH20 1 83 1 70 
i— * f i/\v v i iev i oo i / v 


97R 
CI o 


i ud. ,y uy 


OO /I Q 


BRAWH201 85260 

y—f \ iniii lev i u J£.uu 


97Q 

e/ i7 


904 1 QAfi 

eu4.. i y4o 


^4y 


RRAWH201 R5970 

uri/\VVI lev 1 O Oe / v 


9RO 
eOv 


l o.,ooy 




BRAWH201 RR01 0 
ui invvi lev i oov i v 


9R1 
ev I 


ood.. i ooy 


OOC4 

£2o 1 


BRAWH201 RR750 
ui lnvii lev 1 00/ ju 


9R9 

eOe 


el .. 0£.*¥ 


OO CO 


BRAWH201 Q0530 

unn¥ » l lev i C7UJOU 


9ft^ 
eOO 


*f 1 U.. 1 U^*f 




RRAWH201 Q0550 




OO.. lOoo 


2254 


RRAWH201 Q1 QRD 
unnvv i lev i xj \ sou 


9ftK. 
eOO 


14^0..^ 1 


2255 


RRHAN1 00007R0 
u riunl \ i uuuu/ Ov 


too 


o4o..looo 


2256 


RRCAN 1 0001 050 
Dnv^niM 1 uuu 1 vQU 


9B7 
eo/ 


QQQ QOQ 


2257 


RRHAN 1 0001 RR0 


9flft 
eoD 


CHQ QQQ 

o i y..yyo 


2258 


RRf:AN200014R0 


9RQ 

eoy 


11/1 >4i4Q 


2259 


RRCAM900041 R0 


OQC\ 
el?U 


O..Jol 


2260 


RRPAN9000 C .9Q.O 

DrlOrMNeUUUOeOU 


ea 1 


bv..byu 


2261 


RRCAN9000541 0 


9Q9 
eoe 


CO -4 QQC 
Oe . . 1 OOO 


2262 


RRCOC1 0000400 


eaO 


1 1 ..OOD4 


2263 


BRCOC90000470 


9Q4 


OQO 1QC4 

etJO.. I OO 1 


22 64 


BRCOC20003600 


9Q5 
eao 


icq I/ICR 
1 DO . . 1 *l-00 


22 DO 


BRHIP1 0000790 

uni III lUvVv/eU 


eyo 




2200 


BRHIP1 0001 040 
uni 1 1 1 i wv i vh-v 


9Q7 
ea / 


/ 0.. I O I / 


OOC7 

22b/ 


BRHIP20000210 

uni 1 1 1 e v vv ve i v 


9QR 


/ ..O/o 


22bo 


BRH 1 P200035Q0 
uni i i FevvUOvJ7v 


9QQ 


D I O. . 1 eOO 


2269 


BRHIP20005060 

usi 11 lit evvvwvv 


^00 


7^fi 1 1 7ft 
/OO.. 1 1 fo 


22 /U 


BRSSN20001970 


301 


OS. .OeO 


0071 
22/1 


BRSSN20005610 


^09 
Ove 


1 74 ^.9^7^ 
1 / 4. .>eO/ O 


00 70 
22/2 


BRSSN20005660 


OUO 


eUoy. ,>^ooo 


0070 
22 /o 


BRSSN20066440 


304 


248.. 1474 


2274 


BRSSN20074640 


305 


257.. 985 


2275 


BRSSN20091190 


306 


121 4.. 2002 


2276 


BRSSN20092440 


307 


4..396 


2277 


BRSSN20093890 


308 


94.. 71 7 


2278 


CD34C20001750 


309 


10..840 


2279 


CT0NG1 0000090 


310 


2551. .2991 


2280 


CTONG20000340 


311 


856.. 1929 


2281 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEO ID NO of amino acid 




sequence 




sequence 


CTONG20002790 


312 


373..70~8 


2282 


CTONG20004110 


313 


74.. 3079 


2283 


CTONG20004520 


314 


146. .772 


2284 


CTONG20007660 


315 


192. .1448 


2285 


CTONG20008190 


316 


1313. .1789 


2286 


CTONG200084 


317 


1034 1849 


2287 


CTONG20015240 


318 


134.. 874 


2288 


CTONG200 17490 


319 


346.. 2235 


2289 


CTONG20020660 


320 


21 9. .635 


2290 


CTONG20020950 


321 


24..3257 


PPQ1 


CTON G20027660 


322 


103.. 942 


PPQP 


CTON G20029030 


323 


102.. 2492 


2293 


CTON G20030280 


324 


219 2891 


2294 


CTONG20031150 


325 


2175 2546 


2295 


CTONG20031890 


326 


8.. 1705 


2296 


CTON G20032930 


327 


352 3102 


PPQ7 


CTONG20033500 


328 


1683 2018 


PPQ8 


CTONG20033610 


329 


404 >3P03 


PPQQ 


CTONG20033750 


330 


Q5 1 QQQ 




CTONG20035240 


331 


234 2687 

tut • . tUO / 


p^ni 


CTONG20036800 


332 


432.. 770 




CTONG200369Q0 


333 


1568 P^47 






^14 


114 47n 




CTONGP0041 1 50 


^5 


406 1 1 64 

HUD, . 1 1 0*+ 




CTONGP0041 P60 


T36 


477 PP76 




CT0NG20042640 


T37 


6 2*381 


p^n7 


CTONG20044230 


338 


11QQ 3P05 


P^D8 


CTON GP0044870 




1 7 P554 

It.. COOH 




CTON GP0045500 


140 


ceo.. \ you 


9*^1 n 

1 u 


CTON GP00466Q0 


*341 


OOJ. ,OHU 


p*31 1 


CT0NG2O 04Q480 


342 


P16 ^>^P68 


p^1 p 


CTON G200504Q0 


343 


4P8 S5Q 


p^1 ^ 

zlo 1 o 


CTON G20051 100 


344 


48. .419 


2314 


CTONGP0051 450 


345 




P^1 5 


CTONG20052780 


346 


Q76 1656 


P^1 6 


CTONG20053990 


347 


PP33 >PQ55 


231 7 


CTONG20055670 


348 


P306 P6P0 


P^1 8 
CO 1 o 


CTONG20055850 

1 I \+ACm, Www V 


349 


506 1246 


P^1 Q 

CO 1 J7 


CTONG20056150 


350 


Q5 1150 

■J ..II 


2320 


CTONG20057750 


351 


P355 P84Q 


P^P1 

COC 1 


CTONG20057950 


352 


3061 3420 


p^pp 

cocc 


CTONG20059130 


353 


109..2613 


2323 


CTONG20060040 


354 


204.. 2630 


2324 


CTONG20061290 


355 


226. .819 


2325 


CTONG20062730 


356 


307..687 


2326 


CTONG20063770 


357 


108..3203 


2327 


CTONG20063930 


358 


250.. 2700 


2328 


CTONG20065240 


359 


1983.. 2333 


2329 


CTONG20065680 


360 


2369.. 2797 


2330 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 




sequence 




sequence 


CTONG20066110 


361 


122.. 1972 


2331 


CTONG20068360 


362 


11 36.. 1876 


2332 


CTONG20069320 


363 


843.. 1238 


2333 


CTONG20069420 


364 


130.. 600 


2334 


CTONG20070090 


365 


561. .2960 


2335 


CT0NG2007072 


366 


431 ..2677 


2336 


CT0NG20070780 


367 


2. .2896 


2337 


CTONG20070910 


368 


87. .1397 


2338 


CTONG20071040 


369 


16.. 1551 


2339 


CTONG20071 680 


370 


189 >2419 


2340 


CTONG2007P930 


371 


193 2643 


2341 


CTONG20073Q C )0 


372 


749.. 2428 


2342 


CTONGP0074000 


373 


81 3185 


2343 


CTONGP00741 70 


374 


153. .1211 


2344 


CTONG20074740 


375 


2754 >3085 


2345 


CTONGP0076P30 


376 


P1QP P560 


2346 


CTONGP007681 0 

W 1 V— ' 1 N \ j£.00 / OO 1 O 


377 


Q0Q P40P 


2347 


CTONGP00777R0 


378 


1517 P170 

1 O 1 / . . I/O 


2348 


CTONGP0 078340 


379 


100 PPPQ 

1 OO. . C-C-C— 


2349 


CTONGP007Q5Q0 


380 


14Q 10RR 

1 *tC7 . , 1 OOO 


2350 


CTONGP00801 40 


381 


159 686 


2351 


CTONGP0081 840 


38P 

out 


58R 8Q7 

OOO. .Ov7 f 


2352 




383 
ooo 


icq i qpc 


P353 


OTONGPona3Q80 


384 

Out 


1 OR 1 440 


2354 


OTONGP00840P0 

V-> 1 WIN V_I*lOUOHO£.0 


385 


301 fififi 
OO 1 . .ooo 


P355 


CTONGP0084RR0 


386 


21 7. .753 


P35R 


CTONGP00R5P1 0 


387 


7RQ 1878 

/ U v7 . . I O / O 


2357 


fTTONGPOl 337P0 


388 
ooo 


15 350 
1 o. .ooo 


P358 


OTONGPOI R55Q0 


38Q 

009 


1 7P1 P044 


P35Q 


OTONGP01 R5750 


3Q0 

Ov/O 


P1R 1Q55 

*1 1 O. . 1 9JJ 


2360 


OTONGP01 RR580 


391 


3P0 161P 

ObO. . I O I c 


2361 


CTONGP01 R7750 

W 1 WIN \J*LO 1 U / / JU 


3QP 


159. .461 


P36P 


CTONGP01 R8P40 


3Q3 

OC/O 


4Q1 856 


2363 


fTTONGPOl R84R0 


3Q4 


5RQ 871 


P3R4 
tout 


CTONGP01 6Q040 


395 


139 8Q4 


2365 


OTONGP01 RQ530 


3QR 


1076 13QQ 


2366 


CTONGP01 70940 


397 


184 1569 


2367 


OTONGP01 74PQ0 


3Q8 

OC70 


or 18P6 

. 1 UL.U 


2368 


CTONGP01 74440 


399 


1246 1623 


2369 


CTONGP01 74580 


400 


1 ..1023 


2370 


CTONGP01 76040 


401 


147 737 


2371 


CTONG201 79390 


402 


1423.. 1881 


2372 


CTONG201 79890 


403 


335. .2344 


2373 


CTONG201 79980 


404 


1821. .2210 


2374 


CTONG201 80620 


405 


1556. .1915 


2375 


CTONG201 80690 


406 


366.. 1442 


2376 


CTONG20181350 


407 


11 67.. 1607 


2377 


CTONG201 83430 


408 


319..2706 


2378 


CTONG201 83830 


409 


393.. 2687 


2379 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


CTONG201 84130 


410 


1970.. 2536 


2380 


CTONG201 84830 


411 


323.. 1204 


2381 


CTONG201 86140 


412 


21. .443 


2382 


CTONG201 86290 


413 


2170..2811 


2383 


CTONG201 86370 


414 


156.. 1244 


2384 


CTONG201 86520 


415 


211. .21 36 


2385 


CTONG201 86550 


416 


260..811 


2386 


CTONG201 88080 


417 


207.. 2774 


2387 


CTONG201 89000 


418 


174.. 1682 


2388 


CTONG201 90290 


419 


220.. 2205 


2389 


CTONG201 90630 


420 


34.. 2082 


2390 


DFNES20016470 


421 


288.. 851 


2391 


DFNES20018000 


422 


40 1596 


2392 


DFNES20025500 


423 


766.. 11 34 


2393 


DFNES20028170 


424 


289.. 1734 


2394 


DFNES20029660 


425 


356.. 2443 


2395 


D FN ES20 032550 


426 


35.. 931 


2396 


DFNES20043710 


427 


108 929 


2397 


DFNES20046840 


428 


786 1310 


2398 


DFNES20055400 


429 


343.. 1584 


2399 


DFNES20057660 


430 


25.. 795 


2400 


DFNES20063460 


431 


26.. 406 


2401 


DFNES20072990 


432 


62.. 11 56 


2402 


DFNES20073320 


433 


377 1735 


2403 


DFNES20076340 


434 


167 898 


2404 


DFNES20080880 


435 


11 ..1669 


2405 


DFNES20088810 


436 


171.. 548 


2406 


DFNES20094820 


437 


107 1807 




FCBBF1 0000230 


438 


104.. 3247 




FCBBF1 0002200 


439 


480.. 782 


2409 


FCBBF1 0004760 


440 


578 1978 


2410 


FCBBF20018680 


441 


177.. 1724 


2411 


FCBBF20020440 


442 


382.. 885 


2412 


FCBBF20021110 


443 


158. .517 


2413 


FCBBF20023490 


444 


64.. 1779 


2414 


FCBBF20028980 


445 


694.. 1014 


2415 


FCBBF20029280 


446 


1680 2021 


2416 


FCBBF20032930 


447 


90.. 452 


2417 


FCBBF20033360 


448 


31 5.. 2369 


2418 


FCBBF20035430 


449 


196.. 726 


2419 


FCBBF20035490 


450 


131. .1387 


2420 


FCBBF20036360 


451 


16.. 366 


2421 


FCBBF20038230 


452 


1468.. 1908 


2422 


FCBBF20038950 


453 


648..992 


2423 


FCBBF20041380 


454 


612..2174 


2424 


FCBBF20043730 


455 


45..>2063 


2425 


FCBBF20054390 


456 


1367.. 1756 


2426 


FCBBF20056580 


457 


82..>2394 


2427 


FCBBF20059660 


458 


672.. 1226 


2428 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


FCBBF20061310 


459 


1582.. 1980 


2429 


FCBBF20066340 


460 ! 


116. .1024 


2430 


FCBBF20070800 


461 


968.. 1447 


2431 


FCBBF20070950 


462 


50..>2299 


2432 


FCBBF30000010 


463 


74.. 643 


2433 


FCBBF30001 020 


464 


16.. 348 


2434 


FCBBF30001100 


465 


341.. >3 125 


2435 


FCBBF30001150 


466 


1209.. 1631 


2436 


FCBBF30002270 


467 


31 9. .927 


2437 


FCBBF30002280 


468 


113 4036 


2438 


FCBBF30002330 


469 


6. .611 


2439 


FCBBF30003610 


470 


1015 2019 


2440 


FCBBF30004340 


471 


120 1013 


2441 


FCBBF30004730 


472 


368 1027 


2442 


FCBBF30005180 


473 


155 2734 


2443 


FCBBF30005360 


474 


29.. 2710 


2444 


FCBBF30005500 


475 


251.. 2494 


2445 


FCBBF30019140 


476 


40.. 2979 


2446 


FCBBF30019180 

1 *w" l_J l_J I uW 1 w 1 W 


477 


140 IfifiQ 

1 tU . . 1 UUw 


2447 


FCBBF300192 


478 


1214 2269 

1 t— 1 "T. ■ WW 


2448 


FCBBF30021900 

1 >l_> 1 \J\J\JC 1 w WW 


479 


63 1973 

UJ . . 1 w / w 


2449 






1^08 2480 

1 JUO..l'+OU 


2450 


FCRRF^OOPfi^RO 


4R1 
to 1 


170 2725 


24R1 


FCRRF*3002Q2R0 


48? 
tot 


c;o 40ftfi 

JL . ,tuoo 


2452 


FCBRP30035570 


483 


217 !>?4R8 


2453 


FCBBF30042610 


484 


33 1244 


2454 


FCRRF30048420 


485 


144. .1094 


2455 


FC RRF^OO^moO 


486 


62.. 21 82 


2456 


FCRRF^OORfiQfiO 


487 


10Q8 141 R 

1 . , l*t u 


2457 


FCRRF300624Q0 


488 


128 1063 


2458 


FORRF^OOfilQQO 


489 


250 828 


2459 


FCBBF30068210 


490 


51 .2762 


2460 


FCBBF30071500 


491 


227 >2898 


2461 


FCBBF30072440 


492 


2485 2865 


2462 


FCBBF30072480 

1 X— 'UUI UWV/( £."TUW 


493 


2602 >3057 

L.UWL-. . W W W # 


2463 


FCRRF30074530 

■ V/UUI WWW f "www 


494 


1031 1432 

1 WW 1 . . 1 tvJt. 


2464 


FCBBF30074620 


495 


840 1316 

W^ W ■ . 1 \J t u 


2465 


FCBBF30075970 

i ^LJLJI W W wf ^J<S 1 W 


496 


146.. 460 


2466 


FCBBF30076310 


497 


42.. 1007 


2467 


FCBBF30078600 


498 


113 1399 

1 I W . . 1 www 


2468 


FCBBF30079770 

1 wUUI www/ w / / w 


499 


6Q3 2777 

uww. .a i i 


2469 


FCBBF30080730 


500 


54.. 467 


2470 


FCBBF30081000 


501 


526.. 924 


2471 


FCBBF30085560 


502 


60.. 1919 


2472 


FCBBF30088700 


503 


39..>3015 


2473 


FCBBF30089380 


504 


8..2701 


2474 


FCBBF30091010 


505 


172..>3465 


2475 


FCBBF30091520 


506 


56.. 2284 


2476 


FCBBF30093170 


507 


974.. 1528 


2477 
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Table 1 (continued) 



Clone name 


SFO ID NO of nucleotide 
sequence 


Position of CDS 


cpo in MO of amino aHri 

sequence 


FCBBF30095410 


508 


64.. 1002 


P47R 

4.H / O 


FCBBF30099490 


509 


2939.. 3256 


2479 


FCBBF301 00080 


510 


1237.. 1656 


2480 


FCBBF30100120 ! 


511 


8.. 2305 


2481 


FCBBF301 00410 


512 


121. .1374 


94 R9 


FCBBF30101240 


513 


16Q6 P3RP 


94fl*3 


FCBBF30101300 

1 WUUI WW 1 W 1 WW 


514 


3R0P ^441*3 

wwwfc..^ fc r*T 1 w 


94 R4 

t4o4 


FCBBF301 05080 


515 


Q73 1836 

W f w. p 1 www 


94R5 


FCBBF30 105440 

1 wUUI WW 1 VW T T W 


516 


1354 P040 


94RR 


FCBBF301 05860 

1 wL/UI WW I WWUUW 


517 


355 >P5P4 


94R7 


FCBBF301 06950 

1 vUUI WW 1 WWWWW 


518 


192. .944 


94RR 


FCBBF301 07290 


519 


100P 1610 


P4RQ 


FCBBF301 07330 

f WDUIiJU I v / www 


5P0 


70*3 1071 


9400 


FCBBF30114180 

1 ' U- J L*J 1 WW 1 1 "T 1 WW 


521 


1777 PPRQ 


94Q1 

£43 1 


FCBBF301 14850 

1 VUUI WW | | *TUwv 


522 


7RQ 150^ 


940 P 


FCRRF3011 59*30 
i vuurou i 1 w.cou 


5P*3 
o.Cw 


417 755 


940*3 


FCBRF3011 RQPO 


5P4 


54°, 1 HOP 

OHO. . 1 OUt 


94Q4 
t4y*f 


FORRF*301 1 RR70 


59R 
0£0 

0£0 


RQ1 PR07 


^H^O 


FORRF*301 1 RRQO 
rwDDrOU 1 i ooau 


1 R4 ^.PRQO 
I Oh. .>£DoU 




FCRRF^OI 954R0 


597 


R7 1 QPR 


94Q7 


FORRFIOI P5RH0 


RPR 


1 R5 R70 


94QR 


FHRRF301 PfldPO 


59Q 


17RQ 91*30 


9AOQ 


FORRF*301PQ010 


5*30 


1 R4 19*3R 
l Oh.. I cOO 


9*;oo 


fprrf*301 *3041 o 


5*31 


1 1 4Q 1 R74 


9K01 


FORRF'301'305RO 


5*39 


15R 919*3 


9^09 


forrf^oi *3P050 

i ODD I ww IO^UjU 


5*3*3 


71 R 1 R54 
/ 1 0.. IOOh 


9ROQ 


FCRRF301 3PRR0 

1 wDDi ww 1 OL.UUU 


5*34 


RR 1051 

OO. . 1 UO 1 


9^04 


FPRRF*301 °,5RQ0 


OOO 


OH ^ -N.9AQQ 
^ 1 4..>t4C)J 


9KOK 


FPRRF*301 *3R9*30 


5*3R 


CM. .OOOO 


tCOUO 


fprrf^oi^rooo ! 


5*37 


R4R 9Q01 


9K07 


FPRRF*301 499Q0 


5*3R 


R7Q 1RRP 
o/ a., i od^ 


tCOUO 


FORRF'sOl 4P.550 


5*3Q 


111 *31Q1 
1 1 i.,oiy i 


9R0Q 


FORRF*301 45R70 




1 5*3*3 1 RRO 
I OOO.. I oou 


951 O 


FORRF*301 51 1 QO 


541 


Q74 1*319 

57/ H.. I O I C. 


9511 


FCRRF301 531 70 


54P 


1 R 9*307 

1 O. ,c.O\Jf 


951 9 


FCRRF301 57P70 


54*3 

340 


R4 ^^O^ 


951*3 
^.0 1 0 


FCRRF301fi17R0 


544 
ohh 


91 R5Q 


9514 

^O I H 


FCRRF301 6451 0 


545 

OHO 


5R1 *30*35 

OO 1 ..OUOJ 


9515 

■£0 I O 


FCRRF301 66PP0 


54R 

OHO 


17ft 4ftQ 

1 / O. .HOO 


951 R 
•CO 1 0 


FCRRF3016QPR0 


547 

34/ 


11 fi Q01 


9517 


FCRRF301 6QR70 


54R 

OHO 


109 407 

I Ut.,4U/ 


951 fl 


FCBBF301 70710 


549 


8.. 382 


2519 


FCBBF30171230 


550 


1735..2361 


2520 


FCBBF301 72330 


551 


2497..2952 


2521 


FCBBF301 73960 


552 


106..>3530 


2522 


FCBBF301 75350 


553 


2..721 


2523 


FCBBF301 77290 


554 


378.. 923 


2524 


FCBBF30179180 


555 


2382..>3452 


2525 


FCBBF301 79740 


556 


299..721 


2526 
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Table 1 (continued) 



5 



10 



15 



20 



25 



l*/IUiic name; 


OCU IU INU. Ul IIUOIUUUUU 


PnQitinn nf PH^ 


cpPi in MO of amino ariH 

OlvJ 1 Vj IMU. Ul all III IU aulU 




seauence 




seauence 


FPRRF^OI 817^0 
i wuijrou lull <j\j 


557 


43.. 351 




FPRRF301 Q4^70 




RQfl 1 057 


2528 


FCRRF301 94550 


559 


9. .1982 


P5PQ 


FPRRF301 Q5RQ0 




1 O. . 1 UOC 




fprrf^oi Q57no 


5fi1 

JO 1 


14 Q7fi 


C.OO I 


FPRRF^OI Q7R40 


5R9 


40R Q015 




FPRRF'VH QP,R70 

rwDDrOU 1 you / U 


5rq 


491 97QQ 

He. \ .,£.1 OO 


95^ 
^uoo 


FPRRF^0901 R90 


5R4 


1407 95R1 

1 *-rW / ,,tJU 1 


95^4 


FPRRF1091 991 n 


5R5 


IfifiQ 9509 


95^5 

tJOO 


FPRRF^n91 5940 


5fifi 


774 1 5ftfi 
/ / *f . . I ooo 


95^R 


fprrf^0990050 


5R7 

OO/ 


14Q1 900R 

I H-O I . ,<lUUO 


95^7 


ppRRPQn.9990,1 n 


ceo 
ODD 


117 R1 A 


^ooo 


FPRRF^n,99^1 1 n 


ooy 


R47 1 1 9Q 

04/ ..II £s7 


9R^Q 

cooy 


FPRRF9n,99'391 n 


O/ u 


141 RRO 

I *4 I . .OOU 




PPRRPRn99 c .Q'*n 


571 
O/ I 


1 RR 1 Q5R 
I oo. . i yoo 


9541 


PP RRP^n99RQ Afl 


Die. 


7Q A5Q 


ORAO 


ppRRP^nQ^nfii n 


O/o 


QQ A An 
yy. .tHU 


9*^4.9 
t.O*tO 


rL/DDrJUtODD/U 


EC? A 


H QQQ 071Q ! 

iooy..^/iy 


*iO'+4 


ppRRPQfiocoQRfi 


etc 

O/O 


91 A 951 A 


^040 


ppRRPQ.n,9ccoRn 


O/O 


97 -v9ARn 




PPRRPQn.9C.7Q7 


A77 
O/ / 


9 1 R7Q 

I O/O 




ppRRPQnoc.Qn^.n 


O/O 


1 nA 1 Rfin ! 


9RAR 




R70 

0/ y 


1 1 o..>^4y4 


^04y 


PPRRPQnof5n4Rn 


con 
OoU 


^o. .01 y 


occn 
iiOOU 


ppRRPQno«QnRn 


DDI 


ooo..yuu 


^OOl 


rODDroU^OOO 1 U 




440..00/0 


^OO^ 


PPRRPQn971 QQO 


coo 
ooo 


1 R7 1 7HA 


zCOOO 


rODDrou^/ooyu 


KRA 


C1 ->.OQ7A 


^004 


cr RRPonooonon 


OoO 


loo 1T01 


<i000 


CP RRCQnOOKQQn 

rODDroU^ooyoU 


CAR 

Ooo 


OOO OQ"7 


occc 
^OOO 


FrRRCQflOfl7Q/in 

rUDDroUto / y*HJ 


OOY 


1 Ooo. y 


9KK7 


PfRRP/innnnfii n 


coo 
OOO 


COO H1QH 

OOO.. 1 lo 1 


<iOOO 


PPRRPAnnm Q9n 

rODDrWUU i y^u 


RAO 

ooy 


7^^ 1 nno 

/ OO.. I UO/L 


cooy 


Pf^RRP/i nnnconn 

rUDDr4UUUOUUU 


con 
oyu 


O/ ..'WO 


^oou 


PPRRpc.nnn.nAi n 

rODDrOUUUU^ 1 U 


oy l 


you., i £OD 


9^R1 


PPRRPcnnnnfii n 

rODDrOUUUUO 1 U 


CQ9 


^R^ RQ7 
OOO. .Oa / 


9E.R9 


PPRRPAnnm «co 
rODDrOUUUl OOU 


oyo 


cfio irk; 
oo^..i OlO 




fp rrp5oooq5Q0 


£.04 

oyn- 


197 Q91 


95R4 


fp rrf50ooaq.50 


50.5 

oyo 


^ I DO. .tOtO 


95R5 


r t DrlH£UUUOU*fU 


oyo 


one Q-ioo 


95RR 
£OO0 


FFRRA9nnn7R90 


5Q7 

oy / 


1 Rfi RQO 

i ou. .oyu 


95 R7 


FEBRA200 18670 


598 


184.. 1077 


2568 


FEBRA20026820 


599 


103.. 1836 


2569 


FEBRA20027070 


600 


663.. 1736 


2570 


FEBRA20029620 


601 


565.. 1206 


2571 


FEBRA20031000 


602 


380. .2551 


2572 


FEBRA20031150 


603 


2925.. 3293 


2573 


FEBRA20031280 


604 


362.. 31 24 


2574 


FEBRA20031810 


605 


1093.. 1455 


2575 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO ID NO nf amino arid 




sequence 




sequence 


FEBRA20035200 


606 


2607.. 321 5 


2576 


FEBRA20035240 


607 


74.. 826 


2577 


FEBRA20038220 


608 


157.. 1305 


2578 


FEBRA20038330 


609 


161 2.. 2253 


2579 


FEBRA20038970 


610 


1279. .2811 


2580 


FEBRA20039070 


611 


967 1575 


2581 


FEBRA20039260 


612 


23.. 685 


95R2 


FEBRA20040230 


613 


1663 2076 

1 ww^w»4^»wr \J 


95R3 


FEBRA20040260 


614 


244.. 561 


2584 


FEBRA20040290 


615 


1488 2330 

1 r W w • • W w w 


2585 


FEBRA20040560 


616 


711. .1496 


25R6 


FEBRA20045380 


617 


81 ..398 


95R7 


FEBRA20046200 


618 


125 2062 


95RR 


FEBRA20046280 


619 


694 1017 


2589 


FEBRA20046510 


620 


859 2256 


2590 


FEBRA20057010 


621 


757.. 11 07 


95Q1 


FEBRA20063720 


622 


11R 1R7R 

1 to.. IO/ o 


95Q9 


FEBRA20076200 


623 


Q03 con 




FEBRA20078180 


624 


1517 1RRR 

I J I / 1 ooo 




FEBRA20078800 


625 


94 644 


95Q5 


FEBRA20080860 


626 


5R4 241 Q 


95Q6 


FEBRA20082660 


627 


114 17Q3 

I It.. It 9J 


95Q7 


FEBRA20083410 


628 


116Q 1561 


95QR 


FEBRA200 84750 


629 


RRR 1909 




FEBRA20086600 


630 


670 1407 


96nn 

C.UUU 


FEBRA20087550 

1 1— l-^l ir\4»w V/ W# www 


631 


1140 1814 

I 1 *+U . . lult 


9601 

£OU 1 


FEBRA20088610 


632 


315 R1R 

O 1 sJ. , O 1 o 


9609 

£OU£ 


FFRRA200RRR10 


633 
uoo 


1309 9091 


9AOQ 

£OUo 


FFBRA200QG1 60 


634 


1 739 




FEBRA20090220 

1 1— 1— J 1 Irlwwwwwwbw 


635 


1 06.. 2271 


9605 

£OUJ 


FFBRA20091620 


636 


159R 1Q35 


9fiOR 
cDUO 


FEBRA20092760 


637 


6Q4 1317 


9607 

£OU/ 


FEBRA20093270 


638 


2205 2507 


960R 

£OUO 


FEBRA20093280 


639 


9165 94R5 


960Q 


FEBRA200Q5410 


640 


967 647 


9610 


FEBRA20098040 

1 ^ L*/ 1 IribVv w WwTV 


641 


43 459 


9611 


FEBRA20099860 

1 l_ L-/ I Iri^wwwwWww 


642 


55. .573 


9619 


FEBRA20101410 


643 


3Q6 740 


9613 


FEBRA201 08020 


644 


181.. 492 


2614 


FEBRA20 108580 


645 


542. .901 


2615 


FEBRA20 115930 

1 L*wl l/vCv 1 1 www w 


646 


5R 14Q4 


9616 


FEBRA201 16650 


647 


178.. 573 


2617 


FEBRA20121200 


648 


972.. 1490 


2618 


FEBRA20121950 


649 


121 7.. 1723 


2619 


FEBRA20141980 


650 


644.. 955 


2620 


FEBRA201 50420 


651 


314..3124 


2621 


FEBRA20151750 


652 


52..>2299 


2622 


FEBRA201 63980 


653 


93.. 1223 


2623 


FEBRA201 70240 


654 


385.. 1632 


2624 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


i SEQ ID NO. of nucleotide 


Position of CDS 

■ vol 11 vl 1 U I l_y O 


O (Zvj^ 1 vJ tM^. \j\ ctllllilU aulU 




sequence 




seauence 


FEBRA201 72230 


655 


377 1363 


2625 


FEBRA201 73330 


656 


462 2378 


PRPR 


FEBRA201 75020 


657 


2040 2438 


PRP7 


FEBRA201 75330 


658 


62.. 520 




FEBRA201 77800 


659 


1Q18 PPRQ 


PRPQ 


FEBRA201 80510 


660 


350 889 

UU U < iUUJ 


pR^n 


FEBRA201 82030 


661 


341 745 


cUO I 


FEBRA201 87460 


662 


395 7^fi 

\J<J^J . , f ou 


PR^P 


FEBRA20191720 


663 


41 7.. 836 


PR?'* 


HCH0N1 00001 50 


664 


1Q7 R70 

1 17/ ..Of U 




HCH0N1 0001 660 


665 


Q1Q I^R 

1 J.. 1 ooo 


PR^^ 


HCHON20000870 

1 Iwl 1 w 1 DC V UVUU / w 


666 


44P 1 R*^7 




HCHON20002650 


667 


A 7QP 




HCHON2000271 0 


668 


1 RQ 1 4P.7 


&DOO 


HCHON20015050 


669 


5PQ 1 QQC 


PR^Q 


HEART1 000 1420 

i i ur\i ill w v i 


670 


4 14.7R 
*t.. Iff D 


^OH-U 


HEART1 0001 490 


671 


icq hoi 




H EART20009590 


672 


19RQ 1Q^R 
i cos. . i yoo 




HEART20019310 

i i i-r\i i i w i vv I \/ 


673 


47 1 RQ'* 


ORAL 


H EART20022P00 


R74 

Of *T 


11 1 ^7ft 




HEART20031680 


67^ 

U / O 


I UOO..OU 1 o 


^D40 


H EART20047R40 


R7R 


oyo. .tyy't 




1 1 1 — f\f 1 t C\J\J (JO 1 uu 


R77 
Of / 


I 0 1 . . O^O 




HFART200RPR7n 


R7R 

Of O 


HO -4 ftOO 
It).. 1 UzLzL 


OGAQ 


H h n Pfi 1 non 1 1 4n 


R7Q 
of y 


QUO..! 1 U^i 


z:b4y 


HHDPC20051 8^0 


fiflf) 


P1 APP 


^DOU 


HHDPC20081 P*30 

I II 1 LV 1 wCUvU 1 cuw 


RR1 

DO 1 


1^P P1QE. 




HHDPCP00RP7Q0 


RRP 


1 en ftnQ 

i OU..OUO 


occo 




RR*} 


1 AAC 1 771 
I H<+ O.. 1 ff 1 




HHDPCP00R81 RO 

1 II lUlvcUUOO 1 Uu 


RR4 

OOH 


OA A -*.Pft<*Q 


^ioo4 


HLUNGP00084R0 




1^7 1R1R 
I Of I O I o 




H LUNGP000QP6O 


RRR 
ooo 


1 UOJ.. 1 OOD 




HLUNG20009550 


687 


1fi fi^7 

1 o. .00/ 


P«R7 


HLUNGP001 01 ^0 

■ ■ LUIM \-AC.\J\J I U 1 Ju 


RRR 
ooo 


i npo 1A0n 

1 UOj.. 1 *fUu 




HI UNGP0011PR0 

1 ILvlMVJ£.UU 1 1 L.UW 


RRQ 


1 47Q 


often 


HLUNG20011440 


RQO 


ooq 1P07 

OuO. . 1 £.\J § 


dOuKJ 


HI UNGP0011480 

i inji>iut.uu i i tuy 


RQ1 


ATK 1 f^R7 
HO. . I OO/ 




H LU N G200 1 P 1 40 


RQP 
oat 


1 RR ROR 
I OO..OUO 




HLUNG20014590 




101^ ^PP41 

I v I O. ,?tt4 1 


Oflfil 
dOOO 


HLUNG20015070 


RQ4 


^77 1 QA^ 
Of / . . i y«+o 


Of\AA 

dooq 


HLUNG20015180 




4RR 1R0R 

HOO.. I OUO 


JLOOO 


HLUNG20020500 


696 


1232.. 1687 


2666 


HLUNG20020850 


697 


802.. 11 79 


2667 


HLUNG20021450 


698 


753.. 1109 


2668 


HLUNG20023030 


699 


2466.. 2855 


2669 


HLUNG20024050 


700 


651.. 1568 


2670 


HLUNG20025620 


701 


1424.. 1765 


2671 


HLUNG20028110 


702 


164.. 1996 


2672 


HLUNG20029420 


703 


160..774 


2673 
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Table 1 (continued) 



Clone name 

WlVI lv 1 lul 1 lv 


SEQ ID NO of nucleotide 


Position nf POQ 


ocu iu inw. oi amino aciu 




sequence 




seauence 

www Uwl Iww 


HLUNG20029490 


704 


135.. 563 


d\j § *+ 


HLUNG20030420 


705 


116 1Q0R 

l III,, l aUU 


dO 1 3 


HLUNG20030490 


706 


553 160R 


d\J 1 o 


HLUNG20030610 


707 


920 1465 

ecu . . i tuo 


PR77 


HLUNG20031620 


708 


604 lO^fl 


PR7R 
da to 


HLUNG20032460 


709 


PQ 1 cop 


p«7q 


HLUNG20033060 

1 * 1— ' 1 1 v^t- www W WW v 


710 


10RR <*PPR7 


dOOV 


HLUNG20033310 


711 


131Q 1RR4 


PRR1 
too I 


HLUNG20033350 

1 1 ^W> 1 ^ W4bW WW www V 


712 


1 035 5»PP05 


P«QP 


HLUNG20034970 


713 

f lw 


1 04R 1 747 

I UH-O. . 1 / Hi 


PAQ^ 


HLUNG20037140 

1 1 L. w? 1 tI mL w w W / 1 " \y 


714 


1 ^4P 1 RQR 
I Of*;.. I DaO 


OAQ4 


HLUNG20037160 


71 5 
/ i o 


1704 ^15P 
I /U*h .O 1 Od 


dOOD 


HLUNG20037780 


71 R 


1 1 QO 1 RR5 

i i yu., i odd 




HLUNG20038330 


717 


4^ 1PR4 
HOO.. ltO*f 


OCQ7 


HLUNG20041540 


718 


1 qq 1 


cDOO 


HLUNG200415Q0 


71 Q 


I OU..>^0*f I 


£ooy 


HLUNG20042730 


7P0 


icn 14R4 


OftQA 

£oyu 


HLUNGP0045340 


10 A 

1 d \ 


1 RQ KOQ 


^oy i 


H LU NG20047070 


7PP 


I O I U. . I D / d 


dOifd 


HLUNG20050760 


7P^ 


IR-tO OHO 


oeno 

<ioyj 


HLUNGP00R1 330 


7P4 




^by4 


hi unrpoosp^oo 


7PR 


oo.. l UUo 


2695 


hi UMGpnn^47Qn 




1/11 Q OCylQ 


ocne 
^byb 


hi UNGpnn^^P4n 


7P7 
/t / 




0C07 


hi i jni^poo^rrro 


7pq 

/ ^o 


A CIO 


2698 


HI UNGP00573R0 


7PQ 
/ dif 


^OQ 7QQ 


zibyy 


hi UNfiPon c »qp4o 


/ ou 


1 COQ 1 QVO 


2/00 


hi UNG?nfifififi7n 


7^1 
/ O 1 


/0O..101 / 


2/01 


HI UNfiP00fi1700 


7^P 
/ Od 


HOCft 1 ROC 
1 OOO.. IDOO 


2/02 


HI UNGP00R5700 


7*31 
/ oo 


QO Oi i 


2/uo 




7^4 


o*f*fr..l^ol 


2/U4 


HLUNGP0067R1 0 


7^R 


1 7R «QQ 


2/ Uo 


HLUNGP00681P0 


7^R 
/ oo 


OOo.. I do I 


2/Ub 


HLUNGP0Q6Q350 


7^7 
/ o/ 


1 qR 1 R71 

I SO,. I Of \ 


07A7 


HLUNGP007041 0 


71R 
/ oo 


474 QOQ 


2/ Uo 


HI LJNGP007P1 00 


7^Q 
/ oa 


1 R4 1 Q7q 


2/ uy 


HLUN1GP007P1Q0 


740 


■1 Qq4 1 7PQ 

i oy*f . . i / do 


dl 1U 


HI UNGP007P4^0 


741 


1P7 4RR 
I df ..^oo 


2/11 


HLUNG20074330 


74P 


1RP1 "s.iq7R 
i ad i iy/o 


P71 P 
^/ 12 


HLUNGPQ079310 


/ H-O 


1 4R4 PO^O 


2/ lo 


HLUNG200813 


744 


hoo iqm 

I ^O. . 1 a I U 


971/1 
2/14 


HLUNG20081530 


745 


1218.. 1889 


2715 


HLUNG20082350 


746 


313..2109 


2716 


HLUNG20083330 


747 


770.. 1138 


2717 


HLUNG20083480 


748 


183.. 1895 


2718 


HLUNG20083840 


749 


1389.. 1811 


2719 


HLUNG20083960 


750 


1214.. 1630 


2720 


HLUNG20084790 


751 


11 85.. 1745 


2721 


HLUNG20085210 


752 


138..779 


2722 
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Table 1 (continued) 



VvlUIIC 11(31 lit/ 


SFO ID NO of nnrlpntiHp 


Position of 

rUolllUM Ul uL/O 


obu iu i\u. or amino acia 




sequence 






HLUNG20088750 


753 


1 Ol 1 . , 1 UJJ 


^/ ilO 


HLUNG20092530 


754 


224.. 577 


P7PA 
^/ <cH 


HLUNG20093030 


755 


1836 PP4R 


P7PR 

<L / ^O 


HLUNG20094130 


756 


P1PQ PR.R.4 


070C 
£.1 £.0 


KIDNE200116Q0 


757 


1Q7 R0.1 

1 O / . .OU I 


0707 


KIDNE20016360 


758 


R1 P7QR 


P7PQ 


KIDNE20024380 


759 


1017 ^1QQ4 

IUI/i>?l OOH 


P7PO 


KIDNE20027980 


760 


ulj, . 1 OO 1 




KIDNE20080690 


761 


1 1R00 

I . . I OUU 


P7Q1 


KIDNE20081170 


7RP 


1 ^RR OAAA 
I OOD. .t*r*+*+ 




r\iL^ix t^uuoo i uu 


/ DO 


oDO.. 1 04t 


0700 


KIDNEPOORSfiPO 


7R4 


P1R 114P 
^ ID.. I I 


^/OH 




7RR 
/ DO 


on h C7o 


O70C 
41/ OO 


(AIL/IN LtUUCrVHU 


7RR 
/ DO 


Ol 0..O£O 


07QO 

dfoo 


KIDNE20084730 


7R7 


can pmi 

OOU. .cO 1 1 


P7Q7 
dt Ol 




7RR 
/ DO 


Q *5<JP 


07QQ 

d/oo 


1 X 1 J >J Li.UUOU*+i7U 


7RQ 
/ D57 


1 RP 1Q1Q 


^/jy 


KinNIFPOflRfififin 


770 
/ /U 


.. /UU 


^/hU 


KinNIFPfinRRQ7n 


771 
/ / 1 


pnp RAfi 

ilU^..O*+D 


^/4l 


KinNF?nnR7Rfln 


77P 
lie. 


RRR 1 nOQ 
DOO.. 1 UUO 


d/Hd 


K 1 DN FPn0flRP4.fl 


i to 


1 U 1 . . 1 1 JO 


dfOo 


KinNFPnnflQR7n 


77A 

1 IH 


d\c...\Oe. 1 




ixlUIN C^UUO 1 UOU 


77R 
/ /O 


ii/<s:..o/U 


dlHO 


KinMFpnnQApRn 


77R 
/ 10 




2746 


K 1 DN FPnOQ4R70, 


777 
III 


QQ7 POQO 


dlHI 


r\IL/l\l CcwU9J90U 


77Q 
/ lo 


io/o..> lo/o 


2/48 


KIDIMFPOITMRO 


77Q 

/ / O 


Pin siwfi 

^1 U..>1 OOD 


2/4y 


KIRN FP01 ^RRO, 


7RO. 
/ OU 


/ Ho. . 1 Uo/ 


2/oU 


KinNFPf)1^d1^n 
r\iL/i>j n.c.\j i oh i ou 


7R1 
lot 


i 7A 707 


2/01 


KinNFPOndRQn 


7RP 


QO/I QOQ 


07C0 

2/o2 


KinNFpnn7^m 

fNILJIM C£.\J lO/O IU 


/ oo 


OU4..0/D 


2/oJ 


K l D M FPfi 1 ^RA*>n 


7RA 
1 OH 


AQQ QOQ 

4yy..o^o 


2/54 


KinNFP0140R70 


7RR 
/ oo 


7Cn s^9HR 


2/oo 


KIDNFPni411Pn 


7RR 
/ OO 


1 QQ7 1 Q<lO 


2/ob 


KinNFPni4170f» 


7R7 
l Ol 


i PPR PPRR 
I O^O..^^OD 


07C7 

2/0/ 


KinNFPf)14PfiR0 


7RR 

/ OO 


7QR iPQQ 
/ OO.. I £ua 


07CQ 

2/Oo 


KinNFPfi14PQ00 


7RQ 

/ DO 


4R 7RA 
HO.. / OH 


07C0 

2/oy 


KIDNFP014^POO 


/ OU 


QRP 1 ZlRP 
oot.. IHOt 


2/DU 


KiniMFP0147170 


7Q1 


1 U I # . . i ny O 


2/Dl 


KIDNIFPOI 4R0R0 


7QP 


H HQ7 i R7K 

i uy i . . i o i o 


07M 

2/D2 


KinivlFPm 4Q7Rfl 


/yo 


OR7 1 Q7H 


2/bo 


KIDNE201 50730 


794 


1671. .1991 


2764 


KIDNE201 52440 


795 


3.. 1346 


2765 


KIDNE201 54330 


796 


422.2713 


2766 


KIDNE20154830 


797 


1588.. 1923 


2767 


K I DNE20 155980 


798 


1164..1595 


2768 


KIDNE20157100 


799 


90.. 1286 


2769 


KIDNE20160360 


800 


41 3.. 2692 


2770 


KIDNE201 60960 


801 


10..534 


2771 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



LrlUilc (loll It? 


cpn in NIO nf nuHpntirlp 


r UdlLIUM Ul UL/O 


otu iu nu. oi amino acio 




sequence 




coni lonro 
ocljUcI lUc 


k i nfj fpdT fiT7 1 h 

Fx 1 LJ\ \ VLC-\J 1 UJ / 1 u 


802 


1 OUJ.. 1 9 / O 


9779 


KinNlFPOIR^QO 




RR 9^1 


977Q 
C.I to 




R04 

Out 


IOC 9«MR 


£.1 /4 


KinNiFpm 7n4nn 


OUJ 




977C 
iLI /0 




R0.R 


CH "1 nco 

O 1 . . 1 UOt 


977ft 
^/ /O 


KiniMFpni7^^n 


R07 




9777 
£.1(1 






P4-1R 9R7Q 


O770. 
£./ fa 


KinNFpm flifi7n 

r\i lmn \z.c.\j i o i u / u 




1 AH R^Q 

i ou. .ooy 


9770 

#y 


KIDNIFP01 8P*540 


O 1 u 


1 qc 1 7/c 

1 OO. . 1 / HO 


97RH 


KinNFPniflfii7n 

r\iuiN C£.\j I oo i f u 


R11 

O 1 i 


RO. 7A.R 
OO../ HO 


97A1 


KIDNFPm RRR^O. 


PI 9 
O 1 tL 


CiQ QOC 

oi y. .y^o 


97Q9 


KinNFpm flQRQn 

r\iuiNc^u i oyoyu 


O 1 o 


1 004..^HH'0 


07QQ 

C.I oo 


k"mMFpni RQQRn 
rxiuiNc^u i oyyou 


R1/1 


^o4.. i bob 


^/ o4 


KinNiFpniQi R7n 


O 1 o 


RRH 1 Q79 


07QC 


I i\/FRpnnnfipfin 


R1 R 
O 1 o 


o/y.. lODo 


07QC 

ii/ob 


I i\/FRpnnn7RQn 

LI V CntUUU/ oyu 


m 7 

Ol / 


o^y../bU 


Q7Q7 


I ix/PR9nnn77*;fi 

LI V tn^UUU/ /OU 


Ol o 


1 /o.. io4y 


0700 

2788 


I i\/PR9fifii nRi n 
li ven^uu i uo i u 


oi y 


/o/.. i i4o 


2789 


i ivPR9nm o.7ftn 
li v tncuui u/ou 


o^U 


qc OOO 

yo..ooo 


2790 


i i\zPR9nni noon 
li v encuu i uyyu 


Del 


one h nno 

oUo..iuuy 


2/91 


Livtntuuii 04U 


B22 


■1 ^ Q7 OrtQC 

1 1 o/ ..^Uob 


2792 


LI V fcn2UU 1 ooyU 


O20 


4 coo o4 nn 
1 Oiio.. 2199 


2793 


I iv/PnonnoftAiin 

LI Vtn^UU£044U 


□o>i 

B24 


iUi9..l9oo 


2794 


i iv/PROfiAOAocn 

LI V t n^UUOUOOU 


o20 


h A en oooo. 
l4b9..^o9 


2795 


i I VP r 900090 An 


O20 


OH Q4 OCA/I 

£l o 1 ..2ou4 


2796 


1 ivpr90oqqooo 

LI VtntUUOoUUU 


Q97 

82/ 


TO A t\(\A 


2797 


1 ivpR9ooao7j1o 
li ven^uuHU/HU 


O^O 


1 90..1O/U 


2/98 


i i\/PR90oce97n 
li v t n^uuooii / u 


QOQ 


■1 A Q 4 OA~f 

14o..lo4/ 


2799 


IVI CortlN2UUuO<£UU 


ooU 


OCOft oooc 
^b9U..O^OO 


oo 

2800 


IVI CoMIn^UUU / \ 1 U 


ool 


H /I n-i H QOO 

14U1 ..l o^o 


OOA4 

2801 


IVI COHIN^UUUOl OU 


QQO 


lob..oob/ 


2802 


M pq an90ooroao 

IVI t oMINJ^lUUUoy^f u 


ooo 


l^..014 


OO AO 


mpqan90ooqogo 


OOH 


OA"? -tQQO 

^4/ .. 1 yy^ 


OOAjI 

2U04 


M PQAMonni ft.970 

IVI toMIN^UU 1 O^. / U 


ooO 


o4b..^Uoi 


2805 


IVI COAIM^UUt 1 1 OU 


bob 


104U..^b/b 


OO AC 

280o 


M PQAN90091 990 
IVI PoAIM^UU^ 1 c.c\} 


QQ7 

oo/ 




OO AT 

2o0/ 


MPQAM^nnoi A70 

IVICOMIN^UU^ 14/U 


QOQ 
OOO 


CCD 4>M£ 

b0o..l44b 


OQ AQ 

2o0o 


M fq AN9no9i rp,o 

IVI COMINtUU<l 1 O0U 


ooy 


^1 1 IIO 


oo An 
2o09 


M F^ AN900PRR7n 

IVI COnlM^UUcDO / U 


RAH 

ohu 


cq OAKCl 
00..^4OU 


9Qi A 
2o 1 U 


MPQAN9n0979AO 
IVI COMING U U^ / ^HU 


OH- 1 


oy..iy4u 


2o II r 


MPQAMPnn^7Qnn 


RAO 
OH^ 


919 QQ99 


OQ 4 O 

2ol2 


MESAN20029780 


843 


1900.. 2331 


2813 


MESAN20030350 


844 


142..>2239 


2814 


MESAN20030370 


845 


735.. 2462 


2815 


MESAN20030390 


846 


3..389 


2816 


MESAN20033220 


847 


68.. 478 


2817 


MESAN20034440 


848 


42.. 21 83 


2818 


MESAN20038520 


849 


31.. 2547 


2819 


MESAN20041380 


850 


7.. 342 


2820 
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Table 1 (continued) 



L/ione name 


oprj in MO nf nnrlfintiriA 
seauence 




SEQ ID NO. of amino acid 
sequence 


MF^ANPnn4R7Rn 


RR1 


145.. 1002 


2821 




MP*? ANPnnRfiftQ.fi 
ivi conivtuujuoau 


RRP 


1R7 11 PR 


2822 




M F<^ ANPfinR7P4fi 


RR^ 


101 ..778 


2823 




MF^ANPnnRftim 

IVlCOnlM^UUUO 1 IU 


HRd 

OOH 


174 10P? 


2824 




M F^ANPfifiRQ^fi 


RRR 


pfln 17AP 


2825 




m f<5 ANpnnfinppn 


ooo 


77R 


2826 




m f^ Afvionnfinyio.n 

IVI EoAINtUUOU*fOU 


RR7 
oo/ 


1 RRQ ^4fiQ 

I 90a.,OH09 


2827 




M FQ ANPnfifiRQQfi 
IVI COAINtUUOOyyU 


RRP 
OOO 


OHO COO 
£ I 0..000 


2828 




M F 5 ? AM9nnR7A^n 
IVI COnlN^UUDf HOU 


RRQ 


1 PR7 1 Q7R. 

1 lO / . . 1 57/ O 


2829 




IVI EOAIN^UUOyOOU 


oou 


OO0..>O*+/ 


2830 




MPQAMonnazi Rn 

IVI boAIN^UUOH 1 OU 


OO 1 


7R fifiR 
/ 0. .OOO 


2831 




IVI toAIN^UUoOOOU 


qro 


1 1 ftp i fiRfi 
1 1 OO.. 1 OOO 


2832 




ivi toArN^uucjy^ou 


OOO 


ioy.. 1^04 


2833 




m FQAMonnom on 

IVI boAIN^UUy U i y u 


OD*f 


ORR PRQR 

^oo. .toyo 


2834 




M FQAMonnQAi Rn 

IVI bOAIN£UUy4 1 OU 


OOO 


1 RQft 01 RO 

i oyo..^ i oy 


2835 




KA cc A Monnocoon 
Ivi boAIN^UUyo^U 


□ CD 

ODD 


HiQ 1 Q70 

nio..iy/^ 


2836 




MrcA Monnoeonn 
IVI boAIN^uUyDOUU 


ob/ 


04 -H Q7 

ol ..llo/ 


2837 




IN boUr <iuuu40£U 


OOO 


•1 no i Ki O 

i uy..io i^ 


2838 




IN toUr £UUUDU4U 


oby 


Q4c h ono. 
o lb..louo 


2839 




In 1 £lNb£UUl o/4u 


ovu 


1 410..1 / 1 c. 


2840 




in i iirvitiiuui ooyu 


Q71 
O/ 1 


on A ncH 

oy. . 1 uoi 


2841 




IN 1 £iNh£UU£l OOU 


Q70 
0/£ 


<ibo.. IbOo 


2842 




In 1 ^INb^UUZo^UU 


070 

o/o 


i ooo OCKY7 

1 ooo..^yu/ 


2843 




In 1 ZNcdUvdob 1 0 


o/4 


OOQ HOQ1 

OOO.. 1001 


2844 




In 1 ^Nb^uu^o/uu 


Q7C 

o/b 


H CC H 077 

l bb.. 10/ / 


2845 




In 1 <£IN b^uuoo 1 ou 


Q7C 

o/b 


4yb..y 1 £. 


2846 




In 1 ^Nb^Uuo/uoU 


Q77 
O/ / 


HQ /I i -4 


2847 




IN 1 ^Nb/iuOooo/u 


O/O 


1 yu..io4o 


2848 




IN 1 ^Nb^Uuoy/i l U 


Q7Q 

o/y 


00c coo 

^b..boy 


2849 




In 1 ^Nb^uu4*£obu 


OOU 


7nO 1CCQ 

/Uo..lObo 


2850 




In 1 ^lNb<iOU4ol yu 


OO I 


OO CQQ 

00.. oyy 


2851 




IN 1 *:INbiiUu4/o/U 




OH/I non7 


2852 




MTOMFOnnRROQn 
IN 1 ZINb^UUDO^OU 


RRR 


D£^.. 1 UOO 


2853 




in I ^fNb^uUooyou 


004 


■i Q*5 QQQ 

1 oo..yyo 


2854 




in i *iiNb*iuuoy*i i u 


QQC 
OOO 


QR 

00. .oy 1 


2855 




IN 1 ^Nb^UUoyboU 


ooo 


h on /i/i 0 

1 ^U..440 


2856 




IN 1 cIN C£UUOU f OU 


QQ7 

00/ 


OOQ QOQ 

^oy. .y<£o 


2857 




MTOMConnfti no,n 
in i ^iNb^uuo i uou 


rrr 
Ooo 


1 «n «nn 

i DU..OUU 


2858 




IN 1 C.YH CilUUOilOOU 


RRQ 
ooy 


pni r^q 

^U I ..009 


2859 




IN 1 ^(Nb^UU04/DU 


nan 
oyu 


PAA 1771 
tOI.. 1 / / 1 


£ObU 




in i ^iNb^uuoooyu 


QQH 

oy i 


RP1 1Pfl4 


2861 




NT2NE20069580 


892 


1404.. 2282 


2862 




NT2NE20070520 


893 


347.. 661 


2863 




NT2NE20073650 


894 


756.. 1088 


2864 




NT2NE20077250 


895 


1815..3185 


2865 




NT2NE20077270 


896 


250..>3642 


2866 




NT2NE20077860 


897 


589.. 1053 


2867 




NT2NE20079670 


898 


404.. 1867 


2868 




NT2NE20080770 


899 


1181 ..1483 


2869 _ 
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Table 1 (continued) 



5 



10 



15 



20 



25 



oione name 






otu iu inu. or amino dcia 




seauence 




Qpnnpnrp 


NTPN FPfiOflP 1 ^0 

IN 1 ^INL^UUO^ 1 JU 






Pfl7n 
C.OIVJ 


NTPN Fpnoflpfinn 


Q01 


fiRfl 1PP7 


PP.71 
^O / 1 


MTPN FPfinflfifi70 

IN 1 ^INLtUUOQU / U 


QOP 


CflQ QOO 


Pft7P 


NTPN FPf)flft7P7fl 




P7R 1 70.Q 


0Q70 
2O/0 


IN 1 ttMCtUUO/ OuU 




y..o^o 


20/4 


ntpn FPnnflRn^n 


one. 
yuo 




OQ7K 
2O/0 


mtpn Fpnnopo^n 

IN 1 ^iNC^UUy^yOU 


yub 


1 4t.. 1 004 


OQ7C 

2o /b 


NTPN FPflOQ^P^O 


yu/ 


Of / .. 1 QUt 


20/ / 


NTPNFPm nAnnn 

IN 1 t IN C£.\J 1 UtUUU 


yuo 


P17 7no 


20/0 


mtom pom n.7fli n 

IN I ^IN b£U I U / O I u 


QOQ 

yuy 


OQ 07C 


OQ.7Q 

2o/y 


IN I ^IN b^U I uo4£U 


y i u 




2ooU 


in. i *lI\ic^.u i ii I yu 


y 1 1 


ono fti^i 


OQQ4 

2oo l 


IN I £lNb<lU I l I U 


Q10 

yi2 


QCQ OOQ1 


OQ QO 

2oo2 


MTPMFPnil AACft 
In I ^INC^U I t h-oOU 


Q1 O 

y io 


1 AA4 ^^7QA 
1 44 1 . .> 1 /y4 


OQQO 

2oo0 


IN 1 2lNb2Ul 1 /OOU 


yi4 


07i QHO 

^/ 1 ..yi^ 


2oo4 


In 1 2I\I b2Ui 1 yyoU 


Q1 C 

y io 


/ME Q/IO 

440..o4o 


OO OC 

2ooo 


IN 1 2lNb2U l^OOl U 


y i b 


1 1 0../1 / 


OO OC 

2oob 


MTOM COfH O/l K7n 
In 1 2iNb2vJ 1240/U 


y i / 


4 o7Q ^ eon 


OO Q"7 

2oo/ 


In l2Nb2U12bUoU 


y i o 


1 0o4.. 104o 


2ooo 


in i2Nb2ui2/yuu 


Qi Q 

yi y 


or 4 CQQ 

00..1 oyy 


oo on 

2ooy 


In 1 2lNb2U14U1 OU 


oon 
y2u 


OOO.. /OO 


ooon 

2oyu 


l\l 1 2Nb2Ul4U2oU 


Q04 

y2i 


H oo non 

i/?b..yoy 


2891 


MTOMCom A H C\A(\ 

N 1 2N b2u 1 4 l U4U 


ooo 

y22 


CO OHH 
OO. .Oil 


2892 


MTO M COn 4 A COC H 

In 1 2iNb2U l4o2oU 


y2o 


nvi coh 
y4..bo 1 


2893 


In 1 2lNb2U l4bo 1 U 


y24 


i OC CC4 

i<ib..bb i 


2894 


mtom com yi QRon 
In 1 2lNb2U l4obyu 


QOC 

y2b 


b44..obl 


oonc 
2oyo 


mtom torn /iQcnn 
In 1 2lNb2U1 4yoUU 


y2b 


ACA OOO 


oonc 

2oyo 


mtom corn cnei n 
In l2Nb2UloUblU 


007 

92/ 


73.. 432 


2897 


MTOM COOH COfiOft 

In 1 2fNb2U io2b2U 


QOQ 

920 


h non och a 


2898 


mtom conn cofton 
In 1 2lNb2Uloob2U 


QOQ 

y^y 


H OO iCiO 

l oo. . io 12 


oonn 
2899 


MTOMCOrtH ccccn 
In 1 2rNb2Ul OObOU 


QOQ 

you 


/l b.. 12 1 0 


onnn 
2900 


MTOM COni R7i Ort 

In l2lNb2Ulb/12U 


QOH 

yoi 


QOT H 0"7H 

y2/.. i2/ 1 


oon A 


in i *iin b^u i do i yu 


QOO 

yo2 


QC Ml 
00..001 


OQQO 

2yU2 


In 1 2l\Jb2U I b/bbu 


QOQ 

yoo 


on OAQ 
2U..o4y 


OQrtO 

29Uo 


MTOM POni 70Q7fl 

in i 2iNb2U i /oy/u 


yo4 


07>1 -^O-i OQ 

2/4. .>2l oo 


2yU4 


MTOM Com 770i O. 

In 1 2IN b2U 1 / / 2 i U 


yjo 


1 1 o..b/o 


oonc 

2yuo 


mtom con "1 fli 7cn 
In l2lNb2Ulol /ou 


yjb 


Oni 7QC 

2U1 .. /OO 


oonc 

2yub 


MTOMFoni jh onn 
In 1 2lNb2Ul ol oUU 


QQ7 

yo/ 


/04..I 10/ 


OQH7 

2yu/ 


MTPNFPH1 ftA7Pfi 


yoo 


I UOO.. I ouo 


OQrtft 

2yuo 


MToQionm ROAn 
In l 2ril^uu i b^4U 


QOQ 

yjy 


OCQ 7>1o 

ooy.. / 4o 


OQDQ 

2yuy 


In l c.r\\z\j\jd. l *iuu 


y4u 


O04. .o/O 


OQ1 A 

2y i u 


NT2RI20033920 


941 


269.. 1444 


2911 


NT2RI20093010 


942 


1478.. 1816 


2912 


NT2RP70001120 


943 


172.. 1518 


2913 


NT2RP70001730 


944 


166.. 1935 


2914 


NT2RP70003110 


945 


15.. 1868 


2915 


NT2RP70012830 


946 


280.. 2337 


2916 


NT2RP70022820 


947 


2819..>3760 


2917 


NT2RP70027790 


948 


389.. 2965 


2918 
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Table 1 (continued) 



5 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


NT2RP70029780 


949 


156.. 1349 


2919 


NT2RP70030840 


950 


278.. 2983 


2920 


NT2RP70031070 


951 


148.. 1740 


2921 


NT2RP70031340 


952 


107.. 1576 


2922 


NT2RP70031480 


953 


203.. 2947 


2923 


NT2RP70035110 


954 


106. .765 


2924 


NT2RP70046410 


955 


155. .1582 


2925 


NT2RP70049610 


956 


834.. 1889 


2926 


NT2RP70056290 


957 


2748.. 31 82 


2927 


NT2RP70056690 


958 


253..>3053 


2928 


NT2RP70057500 


959 


216. .2615 


2929 


NT2RP70064570 


960 


636 2162 

www * • £_ 1 w^ 


2930 


NT2RP70074800 


961 


199 684 


2Q31 


NT2RP70075300 


962 


226 1539 

^•»«— W * i 1 WWW 


2932 


NT2RP70075800 


963 


211 3801 

1 1 • •www 1 


2933 


NT2RP70080150 


964 


306 674 


2Q34 


NT2RP70084540 


965 


8. .406 


2Q35 


NT2RP70087140 


966 


175.. 1074 


2Q36 


NT2RP70090870 


967 


272. .261 7 


2937 


NTONG20002230 


968 


50 >3211 




NTONG20005310 


969 


88. .471 


2939 


NTONG20017620 


970 


40.. 432 


2Q40 


NTONG20029850 


971 


110 1258 




NTONG20031580 


972 


126 863 




NTONG20032100 


973 


65. .1132 


2Q43 


NTONG20034540 


974 


1fi3 2523 




NTONG20035150 


975 


543.. 2477 


2945 


NTONG20043080 


976 


22. .2679 




NTONG20048440 


977 


89.. 1849 




NTONG20049180 


978 


155 1300 

1 WW* ■ 1 Www 


2Q4R 


NTONG20053630 


979 


321 3821 


9Q4Q 


NTONG20053730 


980 


121. .1701 


9Q50 


NTONG20053910 


981 


47 .2758 


2951 


NTONG20055200 


982 


122 1969 


2952 


NTONG20058010 


983 


237.. 1559 


2953 


NTONG20058220 


984 


67.. 1314 


2954 


OCBBF20000740 


985 


165.. 2402 


2955 


OCBBF20001780 


986 


1990.. 2334 


2956 


OCBBF20005220 


987 


590.. 2299 


2957 


OCBBF20009820 


988 


281.. 823 


2958 


OCBBF20011860 


989 


145.. 528 


2959 


OCBBF20012520 


990 


786.. 3023 


2960 


OCBBF20016390 


991 


667.. 1617 


2961 


OCBBF20016810 


992 


123.. 2351 


2962 


OCBBF201 09450 


993 


204.. 506 


2963 


OCBBF201 09780 


994 


253.. 573 


2964 


OCBBF201 10210 


995 


8.. 1072 


2965 


OCBBF201 10730 


996 


1346. .1666 


2966 


OCBBF20111370 


997 


296.. 1345 


2967 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


OCBBF20111600 


998 


375.. 1823 


2968 


OCBBF201 12280 


999 


447.. 860 


2969 


OCBBF201 12320 


1000 


72. .857 


2970 


OCBBF20113110 


1001 


292.. 738 


2971 


OCBBF201 15360 


1002 


1182. .2126 


2972 


OCBBF201 16250 


1003 


124., 21 75 


2973 


OCBBF201 17220 


1004 


605.. 937 


2974 


OCBBF201 18720 


1005 


131. .451 


2975 


OCBBF201 19810 


1006 


1483.. 2478 


2976 


OCBBF201 20010 


1007 


216.. 1106 


2977 


OCBBF201 20950 


1008 


91. .1629 


2978 


OCBBF20121910 


1009 


235.. 2580 


2979 


OCBBF201 23200 


1010 


1046. .1411 


2980 


OCBBF201 42290 


1011 


94.. 900 


2981 


OCBBF201 47070 


1012 


11 93. .3256 


2982 


OCBBF201 52330 


1013 


284. .613 


2983 


OCBBF201 55030 


1014 


188. .724 


2984 


OCBBF201 56450 


1015 


104.. 11 32 


2985 


OCBBF201 57970 


1016 


1390 2313 


2986 


OCBBF201 60380 


1017 


1176 3884 


2987 


OCBBF201 65900 


1018 


407.. 934 


2988 


OCBBF201 65910 


1019 


1260 2495 


2989 


OCBBF201 66890 


1020 


201 ..1121 


2990 


OCBBF201 66900 


1021 


350 >2606 


2991 


OCBBF201 67290 


1022 


186 2858 


2992 


OCBBF201 70350 


1023 


8.. 499 




OCBBF201 74580 


1024 


179.. 1294 


2994 


OCBBF201 74890 


1025 


178 1380 


2995 


OCBBF201 75360 


1026 


1394 1711 

1 Ww". .1/11 




OCBBF201 76650 


1027 


121 ..423 


2997 


OCBBF201 77540 


1028 


334.. 1242 


2998 


OCBBF201 77910 


1029 


1047 1688 


2999 


OCBBF201 82060 


1030 


964.. 1827 


3000 


OCBBF201 85630 


1031 


1839.. 21 95 


3001 


OCBBF201 88280 


1032 


322..696 


3002 


OCBBF20191950 


1033 


47.. 2305 


3003 


PANCR1 0000860 


1034 


20.. 376 


3004 


PEBLM1 0001470 


1035 


155.. 1960 


3005 


PEBLM20001800 


1036 


59.. 1549 


3006 


PEBLM20003260 


1037 


660.. 1121 


3007 


PEBLM20005020 


1038 


315. .647 


3008 


PLACE50001290 


1039 


620.. 1030 


3009 


PLACE50001390 


1040 


156.. 1376 


3010 


PLACE60001910 


1041 


1521.. 2252 


3011 


PLACE60004260 


1042 


562.. 924 


3012 


PLACE60006300 


1043 


12.. 725 


3013 


PLACE60011180 


1044 


526.. 894 


3014 


PLACE60012620 


1045 


106.. 1368 


3015 


PLACE60017120 


1046 


1116..1454 


3016 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO ID NO of amino arid 




sequence 




sequence 


PLACE60052940 




385..>2108 


3017 


PLACE60053280 


1048 


161..>2687 


3018 


PLACE60054230 


1049 


189.. 1550 


3019 


PLACE60054820 


1050 


298.. 852 


3020 


PLACE60054870 


1051 


105 2213 


3021 


PLACE60055350 


1052 


1188 1562 

1 1 WW** 1 


3022 


PLACE60055460 


1053 


160 1197 


3023 


PLACE60055590 


1054 


123 1406 


3024 


PLACE60056910 


1055 


1028 1549 


3025 


PLACE60057860 


1056 


1617 1940 


30P6 


PLACE60061370 


1057 


1 ..1899 


30P7 
out/ 


PLACE60062660 


1058 


97.. 954 


3028 


PLACE60062870 


1059 


604 1278 


30PQ 


PLACE60063940 


1060 


459.. 797 


3030 


PLACE60064180 


1061 


167.. 1240 


3031 


PLACE60064740 


1062 


302 Q5P 


OUOt 


PLACE60066970 


1063 


6Q7 1QQ8 


ouoo 


PLACE6006871 0 


1064 


OUO. . 1 1 0\7 


^0^4 


P LAC E6006 9880 


1065 


QQQ 


oUOO 


PLACE60070500 


1066 


765 1718 

/ UJ,. I / 1 o 


^0^6 


PLACE60071800 


1067 


689 1 3^3 


3037 
ouo / 


PLACE60072390 


1068 


145 5?? 

1 to . . occ 


^098 


PLACE60072420 


1069 






PLACE60073090 


1070 


QR4 1814 




PLACE60074820 


1071 


qq 707 


^041 


PLACF60077870 


1072 


PP80 PfiOQ 


*304P 


PLACE60080360 


1073 


OQC QQQ 
£OJ, .900 


^043 


PLACE60081P60 


1074 


1 1 QP 1 RfiO 


Qrj/lyl 
OU'H 


PLACE6008P850 


1075 


665 11^P 


^045 


PLACE60087680 


1076 


3 836 

O.. OOU 


OUM-D 


PLACE60088240 


1077 


94 570 


^047 


PLACE60092280 


1078 


180 86Q 


^048 


PLACE60092370 


1079 


371.. 691 


^04Q 


PLACE60093380 


1080 


433 1 335 


^05n 


PLACE60095240 


1081 


Q56 131P 


<*051 

OUO 1 


PLACE60095600 


1082 


4 >2605 


^05P 


PLACE60098350 


1083 


86 185P 


^05^ 


PLACE60 104630 


1084 


1267 1671 


3054 


PLACE601 05680 


1085 


1290 1631 


3055 


PLACE601 07010 


1086 


128. .829 


3056 


PLACE60 109910 


1087 


4.. 567 


T.057 

ouo / 


PLACE601 13340 


1088 


127.. 1518 


3058 


PLACE601 18810 


1089 


101. .1657 


3059 


PLACE601 19700 


1090 


199..516 


3060 


PLACE601 20280 


1091 


61. .705 


3061 


PLACE601 22970 


1092 


861..>1396 


3062 


PLACE601 32200 


1093 


240.. 1082 


3063 


PLACE601 32320 


1094 


471. .1121 


3064 


PLACE601 32880 


1095 


495.. 1406 


3065 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of amino arid 




sequence 




sequence 


PLACE601 38840 


1096 


729.. 1553 


3066 


PLACE601 40640 


1097 


840. .1919 


3067 


PLACE601 50510 


1098 


215.. 1246 


3068 


PLACE601 54450 


1099 


25.. 873 


3069 


PLACE60155910 


1100 


255.. 701 


3070 


PLACE601 57310 


1101 


1156 1482 


OU / 1 


PLACE60162100 


1102 


885 1226 


307? 

OU 1 c 


PLACE601 75640 


1103 


242.. 556 


1071 
OU / o 


PLACE601 77880 


1104 


104.. 1327 


3074 


PLACE601 77910 


1105 


59.. 442 


107*> 
OU / o 


PLACE60181870 


1106 


131.. 673 


10 7R 

OU / o 


P LAC E60 184410 


1107 


QQ5 1481 


1077 
ou# / 


PLACE601 84870 


1108 


86 >1402 


107fl 

OU / O 


PLACE601 88630 


1109 


571 1395 


107Q 

OU / 57 | 


PROST1 0001 100 


1110 


716.. 1219 


10R0 
ouou 


PROST1 0001 360 

1 1 t\*S**J 1 1 www 1 www 


1111 


IKOfi 1QQ7 

1 JUL)., 1 C7i7 1 


10R1 
OUo 1 


PR0ST1 00021 50 


1112 


840.. 2345 


10RP 


PROST20007170 


1113 


?1fi 1R7Q 


OUOO 


PROST20007600 


1114 


pq 7/p 


IflR/l 
OUO*f 


PROST20011160 


1115 




OUOO 


PROST20011800 


1116 


1PQ7 1RPQ 


10RR 
OUOO 


PROST20014140 

1 1 IWw 1 b ww 1 "T 1 TV 


1117 


1001 io?? 


OUO/ 


PROST20014150 


111ft 

I 1 ID 


■IROp 1QR1 
I DJc.. I oD I 


OUOO 


PROST200 14650 

1 1 1 1 t w^ w w 


1119 


1 P41 1 R51 i 


lORQ 

OUU9 


PROST200 15210 


1120 


711 14QQ 


ouyu 


PROST20015400 

1 1 1 V^/ | t. ww 1 w/^Tw^w 


1121 


4Q QQQ 


ouy i 


PROST20016760 

I l 1 £.ww f w r \J w 


1122 


fi? 1 R7n 
U£. . 1 0 / u 




PROST20022120 


1199 


10 1 7RP 


onnq 

ouyo 


P ROST20024250 


11P4 


OO. .tHO 


ouy4 


P ROST20028Q70 


11P5 


OU. ,0<l 1 


OUaO 


PROST20033240 

1 1 IwW 1 ^www>w>^~w^ 


1126 


QQ I fiQO 
03. ■ 1 UOc 


lOQR 


PROST20035170 


1127 


6RQ 107fl 


10Q7 

ouy/ 


PROST20035830 


1128 


1154 1468 


10QR 


P ROST20036280 


1129 


1R01 PPQR 


moo 
ouyy 


PROST20036350 

1 1 IWw 9 ^ W ww'w^w'ww 


1130 


22.. 21 75 


n no 

O 1 uu 


PROST20039300 


1131 


87. .635 


11 m 

O 1 U I 


PROST20041460 


1132 


4P7 qcc 


11 np 

O 1 Kid. 


PROST20042700 


1133 


461 1081 

HU 1 . . 1 UO 1 


11 01 
O 1 UO 


PROST20045700 


1134 


472,. 975 


11 04 
O 1 u*+ 


PROST20047440 


1135 


1058 1441 


11 0^ 

O 1 UO 


PROST20048170 


1136 


73.. 555 


11 OR 

O 1 UO 


PROST20050390 


1137 


1358..2035 


3107 


PROST20051310 


1138 


1913..2737 


3108 


PROST20052720 


1139 


1195..1515 


3109 


PROST20052850 


1140 


430.. 11 85 


3110 


PROST20054660 


1141 


55..753 


3111 


PROST20058860 


1142 


506.. 892 


3112 


PROST20060200 


1143 


103.. 429 


3113 


PROST20062820 


1144 


415.. 1215 


3114 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEO ID NO of amino acid 

WU\K 1 w 1 ^ • w 1 CU 1 111 J w uUlU 




sequence 




sequence 


PROST20063430 


1145 


126.. 1199 


3115 


PROST20065100 


1146 


95.. 1522 


3116 


PROST20065790 


1147 


225.. 2555 


3117 


PROST20073280 


1148 


891. .1277 


3118 


PROST20075280 


1149 


97.. 885 


3119 


PROST20078710 


1150 


347.:1213 


3120 


PROST20082430 


1151 


2077 2436 


91?1 


PROST20084470 


1152 


728.. 1498 


3122 


PROST20084680 


1153 


395.. 733 


3123 


PROST20084720 


1154 


1528 1839 


3124 


PROST20087240 


1155 


766 1452 

/ WW ■ ■ 1 ^Tw£*. 


3125 


PROST20093470 


1156 


1288 1647 


3126 


PROST20094000 


1157 


376 1011 

w / w ■ t 1 V 1 I 


91 ?7 


PROST20097310 


1158 


1502 1903 

1 WWk« . 1 www' 


91 ?a 


PROST20097360 


1159 


99.. 602 


3129 


PROST20097840 


1160 


687 128Q 


91 90 


PROST20099090 


1161 


85.. 792 


OIOI 


PROST20102190 


1162 


86.. 406 


919? 


PROST20 102500 


1163 




91 99 


PROST20 103820 


1164 


1770 2078 


91 94 

O 1 OH 


PROST201 05450 


1165 


1027 1911 


3135 


PROST20 106060 


1166 


9 7.. 705 


91 98 
O 1 oo 


PROST201 08850 

1 1 Iww 1 ^w 1 vUUv w 


1167 


117*3 1884 

1 1 f O.. 1 OOH 


91 97 
O I O/ 


PROST201 10120 


1168 


804 1 1 9Q 


91 98 


PROST20114100 


1169 


74 77? 


91 9Q 
o i oa 


PROST20 120070 


1170 


44? 15Q9 


91 4H 


PROST20121570 


1171 


673 1185 


9141 


PROST20 122490 


1172 


10.. 753 


914? 


PROST20 124000 


1173 


943.. 1371 


9149 
o i *+o 


PROST20 125420 


1174 


1480 1Q68 


9144 


PROST20 127450 


1175 


255 857 


9145 


PROST20 130320 


1176 


549 290Q 


91 48 


PROST20 138730 


1177 


494.. 829 


3147 


PROST20 146590 


1178 


618 ?951 


9148 
o I to 


PROST20151370 


1179 


99.. 482 


91 4Q 


PROST20 152510 


1180 


1095.. 21 68 


3150 


PROST20 152870 


1181 


258 1322 

CwU,. 1 Wbb 


3151 


PROST20 155370 


1182 


606 2255 

W w W » »g r , ww 


31 52 


PROST20 156360 


1183 


1557.. 1898 


3153 


PROST20 159320 


1184 


67. .567 


3154 


PROST20 168600 


1185 


336 1700 

www • • 1 r ww 


3155 


PUAEN10000650 ■ 


1186 


341.. 943 


3156 


PUAEN 10000870 


1187 


1676.. 2029 


3157 


PUAEN10001640 


1188 


332.. 1138 


3158 


PUAEN20000800 


1189 


327.. 1811 


3159 


PUAEN20001520 


1190 


155.. 1243 


3160 


PUAEN20002470 


1191 


3..2417 


3161 


PUAEN20003120 


1192 


93..2216 


3162 


SALGL1 0001 070 


1193 " 


89.. 997 


3163 
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Table 1 (continued) 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 




sequence 




sequence 


SKMUS20006790 


1194 


389.. 1330 


3164 


SKMUS20007260 


1195 


47..>1320 


3165 


SKMUS20008730 


1196 


91. .1407 


3166 


SKMUS20017400 


1197 


84.. 815 


3167 


SKMUS20020770 


1198 


179. .781 


3168 


SKMUS20026340 


1199 


57.. 1202 


3169 


SKMUS20040440 


1200 


27.. 1091 


3170 


SKMUS20064810 


1201 


91. .459 


3171 


SKMUS20073150 


1202 


17..>822 


3172 


SKMUS20073590 


1203 


438.. 824 


3173 


SKMUS20079150 


1204 


165 1235 


3174 


SKMUS20091900 


1205 


57. .359 


3175 


SKNMC10001230 


1206 


87. .1814 


3176 


SKNMC20006350 


1207 


326 1738 


3177 


SKNSH10001010 


1208 


1189 1530 

1 1 WW* a 1 WWW 


31 78 


SKNSH20007160 


1209 


11 3. .562 


31 7Q 


SKNSH2000971 0 


1210 


71 ..622 


31 RO 


SKNSH20030640 


1211 


1635 20QQ 


31 R1 


SKNSH20040390 


1212 


246. .911 


31 ftp 


SKNSH20052400 


1213 


926 2017 


31 A3 


SKNSH20057920 


1214 


369 1352 


3184 


SKNSH20068220 


1215 


504 1235 


31 R5 


SKNSH20094350 

WlxllWI IbWWW^TWww 


1216 


70 5Q1 


31 Rfi 
O 1 oo 


SMINT20000070 

wlllll H 1 £_ v W W W \J i w 


1217 


75? 1 RSR 

/ . . 1 OJO 


31 R7 

O 1 Of 


SMINT20002320 


1218 


607 1761 


31 RR 
O 1 oo 


SMINT20006020 


1219 


1215 234R 


31 RQ 

O 1 Oa 


SMINT20006090 


1220 


75.. 602 


3190 


SMINT20007470 


1221 


154 1650 


31 Q1 


SMINT20008110 


1222 


7R4 1557 

/ OH. . 1 %JsJI 


31 Q9 


SMINT20011830 


1223 


34. .447 


31 Q3 


SMINT20011950 


1224 


732 2006 


31 Q4 


SMINT20012220 


1225 


239 616 


31 95 


SMINT20013970 


1226 


11 17.. 1686 


3196 


SMINT20014610 


1227 


287.. 814 


31 Q7 


SMINT20016150 


1228 


91 1 1 675 


3198 


SMINT20017310 


1229 


1220 1561 


3199 


SMINT20021260 


1230 


1066 1407 


3200 


SMINT20023110 


1231 


1696 2073 


3201 


SMINT20024140 


1232 


31. .555 


3202 


SMINT20026200 


1233 


323 2170 


3203 


SMINT20028800 


1234 


476 1696 

§ w • ■ 1 Www 


3204 


SMINT20028840 


1235 


89. .823 


3205 


SMINT20030740 


1236 


172.. 1833 


3206 


SMINT20031280 


1237 


542.. 1489 


3207 


SMINT20035050 


1238 


227.. 21 49 


3208 


SMINT20035510 


1239 


168..>1891 


3209 


SMINT20036440 


1240 


207.. 2084 


3210 


SMINT20038660 


1241 


289.. 1449 


3211 


SMINT20039050 


1242 


175..>2338 


3212 
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Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SMINT20043390 


1243" 


1465..2094 


3213 


SMINT20044140 


1244 


30.. 1718 


3214 


SMINT20044730 


1245 


45.. 1787 


3215 


SMINT20045470 


1246 


1156. .1539 


3216 


SMINT20045830 


1247 


287.. 1882 


3217 


SMINT20045890 


1248 


893.. 1372 


3218 


SMINT20047290 


1249 


2. .637 


3219 


SMINT20048720 


1250 


1834,.>2374 


3220 


SMINT20049920 


1251 


1384.. 1860 


3221 


SMINT20052130 


1252 


271. .891 


3222 


SMINT20054050 


1253 


361.. 1839 


3223 


SMINT20056230 


1254 


73.. 1674 


3224 


SMINT20056240 


1255 


1108. .1533 


3225 


SMINT20062050 


1256 


75.. 1964 


3226 


SMINT20067080 


1257 


138.. 2075 


3227 


SMINT20070620 


1258 


44.. 445 


3228 


SMINT20074330 


1259 


71.. 532 


3229 


SMINT20077920 


1260 


843 1166 

W"TW« ■ 1 1 Ww 


3230 


SMINT20077960 


1261 


1010 2467 


3231 


SMINT20081330 


1262 


250. .61 8 


3232 


SMINT20083290 


1263 


80. .1606 


3233 


SMINT20084910 


1264 


664.. 987 


3234 


SMINT20085310 

V— ' 1 V 1 1 1 H | bWWWWW 1 W 


1265 


129 500 


^?^5 


SMINT20085450 


1266 


66.. 437 


3236 


SMINT20086250 


1267 


1958 2287 

i juu . .ecu § 


3237 


SMINT20086720 


1268 


233 1003 


3238 


SMINT20088440 


1269 


31. .438 


3239 


SMINT20088690 

1 V II 1 T| 1 ^.WwWWWww 


1270 


26.. 580 


3240 


SMINT20089210 


1271 


1259 1672 


3241 


SMINT20089600 


1272 


241 1335 


3242 


SMINT20091190 


1273 


290.. 1765 


3243 


SMINT20092120 


1274 


33.. 41 6 


3244 


SMINT20092160 


1275 


439.. 780 


3245 


SMINT20093630 


1276 


1822.. 2241 


3246 


SMINT20094150 


1277 


511. .966 


3247 


SMINT20094680 


1278 


36.. 452 


3248 


SPLEN20005160 


1279 


2342.. 2659 


3249 


SPLEN20005370 


1280 


1497 1856 

1 i \S t i » 1 Www 


3250 


SPLEN20006950 


1281 


702.. 1343 


3251 


SPLEN20011350 


1282 


1127. .1537 


3252 


SPLEN20012450 


1283 


134.. 442 


3253 


SPLEN20015030 


1284 


376..945 


3254 


SPLEN20015100 


1285 


1367..2137 


3255 


SPLEN20016500 


1286 


328..816 


3256 


SPLEN20017610 


1287 


940.. 131 4 


3257 


SPLEN20017810 


1288 


101. .556 


3258 


SPLEN20019120 


1289 


176..487 


3259 


SPLEN20020530 


1290 


358.. 762 


3260 


SPLEN20023430 


1291 


1067.. 1426 


3261 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEO ID NO of amino acid 




sequence 




sequence 


SPLEN20023540 


1292 


644.. 1903 


3262 


SPLEN20023850 


1293 


467.. >1 879 


3263 


SPLEN20024190 


1294 


31 9.. 1368 


3264 


SPLEN20024510 


1295 


1070.. 1423 


3265 


SPLEN20024620 


1296 


3.. 1514 


3266 


SPLEN20024770 


1297 


134.. 2353 


3267 


SPLEN20024930 


1298 


1150..3069 


3268 


SPLEN20029170 


1299 


67.. 750 


3269 


SPLEN20036780 


1300 


1432. .1752 


3270 


SPLEN20039180 


1301 


359.. 1534 


3271 


SPLEN20040780 


1302 


1675 2346 


3272 


SPLEN20041810 


1303 


801. .1142 


3273 


SPLEN20042200 


1304 


1123 1935 


3274 


SPLEN20043430 


1305 


224.. 565 


3275 


SPLEN20043460 


1306 


55.. 1899 


3276 


SPLEN20043680 


1307 


1337 2371 

1 WW f . .C\J 1 1 




SPLEN20045550 


1308 


5. .1513 


^97R 


SPLEN20048800 


1309 


2038 2352 




SPLEN20049840 


1310 


82 3378 

U L— . .WW 1 w 




SPLEN20050090 


1311 


41 8. .2202 


OllO 1 


SPLEN20051420 


1312 


685 1983 


3282 


SPLEN20054160 


1313 


186 2069 




SPLEN20054500 


1314 


667.. 1443 


OtOt 


SPLEN20055600 


1315 


146 1033 


3285 


SPLEN20057830 


1316 


507 1163 

<Jv /..II UU 




SPLEN20057900 


1317 


1883 2431 

1 WWW. . tT\J 1 


OcLO/ 


SPLEN20058180 


1318 


803.. 11 20 


3288 


SPLEN20059270 


1319 


16? 1586 

1 \J£L . . 1 JUL) 




SPLEN20062830 


1320 


282.. 785 




SPLEN20063250 


1321 


315. .1535 


3291 


SPLEN20063890 


1322 


133 996 

1 WW. . vj v7 U 


w^£7£. 


SPLEN20067010 


1323 


2702 >3023 




SPLEN20071820 


1324 


876.. 1592 


3294 


SPLEN20073500 


1325 


174. .2210 


3295 


SPLEN20073880 


1326 


220 2595 


3296 


SPLEN20076190 


1327 


20. .331 


3297 


SPLEN20076470 


1328 


575. .1714 




SPLEN20080070 


1329 


5.. 1339 


3299 


SPLEN20081640 


1330 


946.. 1308 


3300 


SPLEN20085910 


1331 


1084.. 1434 


3301 


SPLEN20087370 


1332 


2670 3098 


3302 


SPLEN20087860 


1333 


255.755 


3303 


SPLEN20090880 


1334 


1742..2194 


3304 


SPLEN20098030 


1335 


268.. 1077 


3305 


SPLEN201 00040 


1336 


101.. 862 


3306 


SPLEN20101950 


1337 


122.. 1198 


3307 


SPLEN20104150 


1338 


1146..2219 


3308 


SPLEN201 04690 


1339 


661. .990 


3309 


SPLEN20105100 


1340 


1425.. 1859 


3310 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO. of amino acid 




sequence 




sequence 


SPLEN201 08000 


1341 


887.. 1351 


3311 


SPLEN201 08460 


1342 


2254.. 2607 


3312 


SPLEN20110180 


1343 


147.. 1292 


3313 


SPLEN201 10210 


1344 


1831.. 2271 


3314 


SPLEN201 10860 


1345 


1489.. 1887 


3315 


SPLEN20111450 


1346 


2768.. 3208 


3316 


SPLEN20114190 


1347 


154.. 1440 


3317 


SPLEN201 16720 


1348 


9. .1583 


3318 


SPLEN201 17580 


1349 


218.. 1240 


3319 


SPLEN201 18050 


1350 


145.. 11 22 


3320 


SPLEN20121790 


1351 


569.. 1054 


3321 


SPLEN201 25230 


1352 


971.. 1324 


3322 


SPLEN20126110 


1353 


254. .991 


3323 


SPLEN20 135030 


1354 


40.. 1848 


3324 


SPLEN201 36700 


1355 


1296.. 1628 


3325 


SPLEN201 36730 


1356 


1312. .1644 


3326 


SPLEN201 37530 


1357 


2676.. 3221 


3327 


SPLEN201 38600 


1358 


1166, .1582 


3328 


SPLEN20139100 


1359 


63.. 1532 


3329 


SPLEN201 39360 


1360 


763.. 1827 


3330 


SPLEN201 75920 


1361 


1123. .1590 


3331 


SPLEN20176130 


1362 


16.. 549 


3332 


SPLEN201 77400 


1363 


1078. .1431 


3333 


SPLEN201 80980 


1364 


410. .823 


3334 


SPLEN20181570 


1365 


214 >2609 


3335 


SPLEN201 82850 


1366 


168.. 659 


3336 


SPLEN201 82990 


1367 


55.. 1881 


3337 


SPLEN201 83020 


1368 


265.. 723 


3338 


SPLEN201 83950 


1369 


402.. 746 


3339 


SPLEN201 87490 


1370 


98..>2458 


3340 


SPLEN201 90080 


1371 


914.. 1342 


3341 


SPLEN201 90430 


1372 


884.. > 1942 


3342 


SPLEN201 90770 


1373 


97.. 771 


3343 


SPLEN20191020 


1374 


140.. 1426 


3344 


SPLEN201 92570 


1375 


325.. 654 


3345 


SPLEN201 93230 


1376 


253.. 588 


3346 


SPLEN201 93490 


1377 


321.. 854 


3347 


SPLEN201 93750 


1378 


845.. 1306 


3348 


SPLEN20193790 


1379 


290.. 2278 


3349 


SPLEN201 95710 


1380 


64..516 


3350 


SPLEN201 97090 


1381 


11 ..550 


3351 


SPLEN201 97740 


1382 


2.. 499 


3352 


SPLEN201 97930 


1383 


803..>1625 


3353 


SPLEN201 98390 


1384 


14.. 1873 


3354 


SPLEN201 99850 


1385 


575.. 1009 


. 3355 


SPLEN20200070 


1386 


353.. 655 


3356 


SPLEN20200340 


1387 


1047.. 1814 


3357 


SPLEN20201830 


1388 


143.. 1177 


3358 


SPLEN20203590 


1389 


160..471 


3359 
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Table 1 (continued) 



Clonp namf* 

VIUIIC Mull 


SFO ID NO of nucleotide 


Position of 


opf") in WO t>f amino arid 

O IU IVU. Ul dillillw CtLflU 




sequence 




sequence 


SPLEN20204670 


1390 


6.. 311 


11R0 


SPLEN20205120 


1391 


682 1542 


3361 


TES0P1 0000350 


1392 


1646 2101 


11RP 


TES0P1 0001 600 


1393 


282.. 1352 


3363 


TESTI1 00001 90 


1394 


84.. 2042 


11 R4 

OOOH 


TEST1 10000850 


1395 


QP5 1311 

OtJi .lull 


11R5 
oo oo 


TESTI1 0001 570 


1396 


141 ..1919 


11RR 
oooo 


TESTI2000431 0 


1397 


55 2484 


11R7 
ooo/ 


TESTI20005980 


1398 


1 OJv, . I QUO 


IIRft 
oooo 


TESTI2G006160 


1399 


51 17P0 


11RQ 

0009 


TESTIPG006810 


1400 


OHO 171Q 


QQ70 
OO / u 


TESTI20012080 


1401 


QQ 1 Qfi7 


1171 ! 

OO/ I 


TEST120012360 


1402 


17fl 5Q4 


117P 


TESTI20016970 


1403 


133. .1842 


1171 
oo / o 


TESTI2001Q5Q0 

1 I— W 1 1 ZJ >~J \J \J 


1404 


P10 75P 


1174 

OO/H 


TF^TIPOOPROPO 


1405 


P01 1PQR 


QQ"7K 
OO/O 


tfstipoopqioo 


140R 


1741 P147 


007R 
OO/O 


tfstipooiopoo 


1407 


Q4 P0Q1 
,*iuy i 


QQ7"7 
OO / / 


tfstipooio44o 




RO 1Q11 

ou. . i yo i 


QQ.7Q 
OO/O 


TFSTIP0010R10 

1 CO 1 ItUUOUU 1 V 


140Q 


547 1171 
OH/ .. IO/ l 


QQ7Q 

oo/ y 


TFSTIP0011110 


1410 


flflQ PP15 


OO OU 


TFSTIP0011410 


1411 


P01 1R55 


OOO 1 


tf^tipooip77o 

i co i i*lvjl>o*i/ / yj 


141 P 


irq iono 

1 DO. . 1 uu*c 


QQQO 
O0O£ 


tfstipooi4750 


1411 

I *f 1 o 


47P ftOA 


OOOO 


tfstipooi5ho 


1414 

I H I H 


1 Q 55ft 

I o. .ooo 


OOD4 


TFSTIP00157QO 

1 CO 1 IcUUOJ/ 


1415 
1 *f 1 O 


1R P1RQ 
I □. .c. I DO 


OOOO 


TFSTIP001RP40 


141R 


1 R1 PROP 

1 DO. 


OOOO 


TF9TIP0040A50 


141 7 




QQQ7 

OOO/ 


TF9TIP0041 rio 

1 CO 1 ItUUf 1 QJU 


141ft 
I *+ I o 


Qin iror 

OOU.. IO£0 


OQQQ 
OOOO 


TF < mpoo4ii io 

1 CO 1 I^UUtO 1 JU 


141 Q 


410 715 


OQQQ 

oo oy 


tf^tipoo4h fto 

I CO I I^UUHO I ou 


14P0 


Rft OR1R 
00. .£3 IO 


oo y(J 


TF < VTIP0041PPO 

I CO 1 ItUUtOttU 


14P1 


14QR 1Q57 

1 HyO. . 1 / 


QQQ1 

ooy i 


TFSTIP0041Q10 


14PP 


17 1 7Pft 

O/ .. I / 


ooy<£ 


TESTIP0041QQ0 

1 CO 1 \£.\J\J'-r\J<j<y\J 


14P1 


111 1151 

O 1 1 . . 1 1 oo 


ooyo 


TFSTIP0044Q00 

1 CO 1 ItUUttOUU 


14P4 


P14 15Q5 


ooy*t 


TFSTIP00451QO 


14P5 


15R P10ft 

I DO. . c. 1 UO 


ooyo 


TFSTIP0045740 


14PR 


1 QQ 54Q 

1 yy. .Of 27 


QQQ« 

ooyo 


TESTI20046110 


14P7 


P4 1 RPfi 


11Q7 
ooy/ 


TESTI20046490 


1428 


554 P54P 


IIQfl 
ooyo 


TFSTIP0046540 


14PQ 


1 PI P10Q 


Q.QQQ 

ooy y . 


TFSTIP004R870 


1410 


P7fl 1 R45 

£/ O. . I OHO 


1400 
0*fUU 


TESTI20046890 


1431 


276.. 2702 


3401 


TESTI20047370 


1432 


2.. 21 34 


3402 


TESTI20047930 


1433 


239.. 1843 


3403 


TESTI20049060 


1434 


1191. .1784 


3404 


TESTI20049410 


1435 


281. .21 70 


3405 


TESTI20049990 


1436 


355..915 


3406 


TESTI20050170 


1437 


314..808 


3407 


TESTI20050400 


1438 


111. .440 


3408 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



55 



L/ioric ndrne 


OCA \r\ K\ f\ n>f m m\ar\i\rtci 
olW IU l\U. Of nUOIcULIUU 


Pncitinn nf 

PUoltlUM Ul ouo 


OCVjJ IU INVJ. Ul CtllllllU dUlU 








seauence 


TF^Ti9nn^n79n 


1 noy 


fifl Tft91 

□ O. . 1 \JC- \ 


0*tUO 


1 CO 1 l£UUu IcUU 


1 AAn 

I HH u 


1 3D 1 . .C \ H*J 


^Am 

OH 1 U 


1 CO i IHUUO 1 f Ov 


1 A41 


1QQ H QAQ 


^AH 

OH 1 1 


tf qti onn K9C70. 


"1 AAO 
1 HH£ 


/ OS., la IU 


O.A1 O 
Oh i 


TF^TionnKo, nvn 

i Co i i<:uuoou/u 


1 AA^ 
1 HHO 


QQ IOOC 


^A1 ^ 

OH I O 


TFQTionnK^ocn. 

1 CO 1 I^UUOOjlOU 


1 AAA 
1 444 




O.A1 A 
04 I H 


TF^Ti°nnco7p,n 

1 CO 1 I^UUOO / Ov 


1 AAC. 
I HH O 


1 7fi 1 QAR 


OH I O 


1 CO 1 l£UUOOOUU 


1 AA.fi 
I HHO 


A9 1 fiQA 


^A1 fi 
OH I O 


1 CO 1 IcUUjJSjU 


1 AA7 
i hh/ 


coo ^A7H0 


^A1 7 
OH 1 / 


1 CO 1 l£UUOH /uu 


1 AAA 
l HH O 


lb.. I SOD 


^A1 Q 
OH I O 


1 Co i l^UUOOOoU 


1 AAQ 


OJ.. to 1 u 


QAi Q 
Oh l y 


TFQTIOnnKKQQO. 
1 CO 1 t2UUDbOOU 


1 /en 
I 4bU 


\d.t ..1 044 


042U 


TFQTion.nee.non. 

1 CO 1 I2UUD0U0U 


1 AK1 
1 4D 1 


C/1Q 1 CCD 

o4o.. 1 boo 


04^1 


tf QTi9nnK7onn 

1 CO 1 l£UUO/£UU 


1 A CO 


^/y..o 1 1 


QAOO 

04^:^ 


tf QTi9nne.7A<ifi 

1 Co I I^UUO/HOU 


i A CO 


1 01 . . 1 oDO 


04^0 


TCQTi9nne7con 
i to i I2uuo/oyu 


1 /I C/1 

14b4 


0/ .. 1 Ub^ 


1A O/l 

04^4 


tc CTion.nc7Q/i n 
1 to 1 I2uuo/o4u 


i /ICC 


1 oU. .1*1/1 


0/ nc 

042O 


1 co 1 l2uub/ooU 


i /ICC 
1 4O0 


on iftQ7 


042b 


TCQTIOnflKQQKn 
1 CO 1 l2UUOoOOU 


140/ 




O/l 07 

042/ 


i to i i2uuboy2U 


i A KQ 
1 400 


1 Uo.. 1 Uoo 


O/IOQ 

042 0 


TFCTionnKonon 

i to i i2uuoyuou 


i /ICO 

1 4oy 


^lo4. .oUby 


042y 


"TC oti o nn c 00 0 n 
1 co 1 l2Uuby 00U 


h >i en 
1 4bU 


b4U..o4o 


oa on 
O40u 


1 co 1 l2uuQyo/U 


i4bl 


bob. .110/ 


O/l OH 

3431 


T"CGTionncn/i oft 
1 co 1 12uUoy4oU 


H >1 CO 

1 4b2 


CO H OCO 


3432 


1 co 1 I2uuby/yu 


H >1 CO 

i4b3 


2Uo.. 100b 


O/l OO 

o4oo 


TCCTionncnoH n 

1 to 1 i2uubyo iu 


•i AG.A 

1404 


0^4. .> iyi4 


O/l O/l 

0404 


TC CTIOnnennQfl 
I CO 1 l2UUbUUoU 


i4bb 


i 70 QCQ 


O/l oc 

040O 


1 co 1 l2UubU lbU 


14bb 


OQO -1 CCO 

000..1 bbo 


O/l OO 

o4ob 


1 to 1 120060oo0 


H AC~7 

i4o/ 


OH H A 07 

01 ..142/ 


O/l 07 

343 7 


1 to 1 l2UUbU4bU 


H /ICQ 

1 4bo 


■j one H 7KO 
1 oUb.. 1 /o2 


O/l OQ 

040o 


1 to 1 l2uUbUoou 


H X CO 

1 4by 


CA OCA'S 

b4..2b4o 


0 a on 
o4o9 


1 co 1 i2uuoi uyu 


1 4/ U 


OQO 7QQ 


O/l /in. 
044U 


TFGTIOnncH onft 
1 CO 1 l2UUbl 2UU 


H A7i 
1 4/1 


OQO 1QQQ 

oo£... 1 yoo 


044 1 


1 to 1 I2UU02 i2U 


H A "70 
1 4/2 


07 H COO 

J/.. lbo2 


O/l /1 0 

3442 


TFcrionneoi on 

1 CO 1 I2UU021 ou 


H. /170 


4 CQ i/HO 

1 by., i *n 0 


O/MO 
0440 


1 to 1 l2UUb2boU 


14/4 


000..I ibo 


O/l /I A 
0444 


TFQTiofin.cooon 
l to 1 l2UUbo3oU 


-\ /17C 


1 004.. I O/O 


OA /IK 
044O 


TFCTionncoAi n 

1 to 1 I2UU004 1 U 


H /47C 

1 4/0 




OAAC 
0440 


TFQTionnft^ftnn 

1 CO 1 I^UUDJOUU 


i A77 
1 H/ / 


1 *^QQ 1QflQ 

i oyy.. i ouy 


^AA7 
OHH/ 


TF CTI0nfiCA070 
1 CO 1 I^UU040/U 


H A7Q 
1 4/0 


oni i 7HC 


OAAQ 
O44o 


TFQTIOO.O.C.ACOO. 
1 to 1 I^UU04DOU 


i A7Q 

1 4/y 


cso 0707 

OO. .C.I Ol 


OAAQ 
044y 


TESTI20064650 


1480 


353..2542 


3450 


TESTI20064990 


1481 


733.. 1290 


3451 


TESTI20065650 


1482 


374.. 1609 


3452 


TESTI20066150 


1483 


98.. 2473 


3453 


TESTI20066170 


1484 


143.. 1309 


3454 


TESTI20066280 


1485 


100..879 


3455 


TESTI20066330 


1486 


26.. 1528 


3456 


TESTI20066590 


1487 


306.. 740 


3457 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO in NO nf amino arid 




sequence 




sequence 


TESTI20066656 


1488 


415. .1362 


3458 


TESTI20067350 


1489 


342.. 1073 


3459 


TESTI20067440 


1490 


196 1887 

1 WW. ■ I i 


3460 


TESTI20067480 


1491 


127.. 1758 


14R1 


TESTI20068530 


1492 


791 1369 


1489 


TESTI20068790 


1493 


1033 >1738 


1481 


TESTI20068940 


1494 


55 >1867 


1484 


TESTI20070400 


1495 


147 2300 


1485 


TESTI20070740 


1496 


743 1480 


3466 


TESTI20071130 


1497 


224.. 1738 


1487 


TESTI20071630 


1498 


38.. 727 


14RR 


TESTI20073460 


1499 


97.. 1584 


14RQ 


TESTI20075240 


1500 


959 1RSR 


1470 


TESTI20076570 


1501 

1 JU 1 


«pp 111 n 

\JC-C-. , I I I U 


1471 


TESTI20076920 

1 U V 1 1 £— W t \J w L.W 


1502 


on fiOft 


1479 


TESTI20079060 

1 ^ 1 l^wwf www/w 


1503 


105 POQfi 


1471 


TESTI2007Q290 


1504 


54 9959 


1474 


TESTI2G079980 


1505 

1 


GO 1 QftO 


1475 


TESTI20080460 


150R 


1458 1787 


147R 


TESTI20081890 


1507 


184 9590 


1477 


TESTI20083890 

l I l^wwwwUww 


1508 I 


11 OP 15Qfi 


1478 
OH / o 


TFSTI20084950 


150Q 

I OVJC7 


OOO..cUJO 




TEST190085R70 


1510 


145Q 9001 


OH-OU 


TFSTI9008R840 


1511 

loll 


10 1 488 


14ft 1 


TFST190088840 


1 519 


1 79 1 587 


14fl9 


TFSTI2008Q9QO 


151 1 


98 491 


0*4-00 


TESTI20090180 

1 i — 1 1 l^vv w w | Uw 


1514 


597 1fi57 


1484 


TFSTI2G090970 


1515 i 


548 9108 


14ft 5 


TFSTI200911RO 


151 ft 


74.5 1 910 




TESTI2G092170 


1517 


889 195R 


1487 


TFSTI200Q3QOO 


1518 


985 1701 


14flft 


TESTI20094620 


1519 


18Q7 911 Q 

1 Ov7 / . .^O 1 t> 


148 Q 


TESTI20095200 


1520 


94.. 11 49 


14Q0 


TESTI2G095440 


1591 


1114 14Q8 

1 1 0*+.. 1 HOO 


14Q1 


TESTI20095770 


1599 


978 -b.9995 


14Q9 


TESTI200Q5880 


1523 


144 1775 


14Q1 


TESTI20097270 


1524 


157 RQ1 


14Q4 


TESTI20099350 

I l— w 1 l^wwwwwww 


1525 


191 !>1771 


14Q5 

O*+x70 


TESTI201 00090 


1526 


1020 2057 


3496 


TESTI201 02390 


1527 


1873 2947 

IO/ J. . t—<—^ 1 


14Q7 


TESTI201 03690 


1528 


71 ..1048 


14Q8 


TESTI201 04090 


1529 


364.. 1716 


3499 


TESTI20105130 


1530 


320.. 1966 


3500 


TESTI201 05910 


1531 


151.. 2265 


3501 


TESTI20106170 


1532 


235.. 1338 


3502 


TESTI201 06820 


1533 


1191 ..1604 


3503 


TESTI201 07240 


1534 


163.. 2844 


3504 


TESTI201 07320 


1535 


377.. 787 


3505 


TESTI201 07340 


1536 


1229.. 1723 


3506 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 




sequence 




sequence 


TESTI201 08060 


1537 


371.. 925 


3507 


TESTI201 12540 


1538 


148.. 1554 


3508 


TESTI201 12860 


1539 


285.. 1829 


3509 


TESTI20113150 


1540 


374.. 1042 


3510 


TESTI201 13940 


1541 


1185. .1598 


3511 


TESTI201 14480 


1542 


161 2032 


3512 


TESTI201 16050 


1543 


218 1756 


3513 


TESTI20116120 


1544 


1 7. . 1 276 


3514 


TESTI201 17500 


1545 


140. .508 


3515 


TESTI201 18460 


1546 


69 1580 


3516 


TESTI201 20500 


1547 


135 926 


3517 


TESTI201 20900 


1548 


60 665 


3518 


TESTI20121040 


1549 


8. .1525 


351 Q 

OO 1 <3 


TESTI20121710 


1550 


278.. 1543 


3590 


TESTI201 22070 


1551 


145 1359 

i "w. ■ 1 www 


3521 


TESTI201 22440 


1552 


25.. 465 




TESTI201 24440 


1553 


46Q 801 


3593 


TESTI201 25280 


1554 


371 1363 


3594 


TESTI201 25440 


1555 


342 671 


3595 


TESTI201 25920 

1 I — ' 1 l£»w 1 bwwbv 


1556 


31 1680 
win i uuy 


3596 


TESTI201 26280 


1557 


16Q 2562 


3597 


TESTI201 30530 


1558 


OQQ 1 RQp 
£30. , 1 U\7£, 


OOcO 


TESTI201 31 440 




tCtCO.. 1 £.1 O 


■359Q 


TESTI201 32310 


1560 


004 713 


**5qo 
ooou 


TESTI201 32680 

1 l— *J 1 1 Cm \J 1 \JCKJkJ\J 


1 561 


1011 144ft 


^5^1 
oOo 1 


TEST! 20 134010 


1 562 


334 14R4 




TESTI201 34270 


1563 


59. .601 


3533 


TESTI201 34680 


1564 


324 2726 


3534 


TESTI201 34970 


1565 


1Q7 1873 

13/ .. 1 O / O 


3535 


TESTI201 36010 


1566 


1*5? >1415 


353fi 


TESTI201 40970 


1567 


376 801 


3537 


TESTI20 142480 


1568 


1466 1Q3Q 


353R 

OJOO 


TESTI20 142540 


1569 


1231 15Q3 


3539 


TESTI20143180 


1570 


201 1 Q4fi 


3540 


TEST120 144390 


1571 


776.. 1552 


3541 


TESTI201 45780 


1572 


1315 1746 


3542 


TESTI201 48380 


1573 


32 2680 


3543 


TESTI201 49880 


1574 


581 982 


3544 


TESTI201 50420 


1575 


221. .1162 


3545 


TESTI201 50920 


1576 


142. .1713 


3546 


TESTI20151050 


1577 


1290 1736 

1 c w w . . | / wW 


3547 


TESTI20151800 


1578 


671. .1099 


3548 


TESTI201 52490 


1579 


717.. 1226 


3549 


TESTI201 53310 


1580 


151. .1317 


3550 


TESTI201 54370 


1581 


192. .1544 


3551 


TESTI201 59380 


1582 


852.. 1679 


3552 


TESTI20161010 


1583 


33..539 


3553 


TESTI201 62780 


1584 


607.. 1299 


3554 


TESTI201 62980 


1585 


224.. 141 4 


3555 
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Table 1 (continued) 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO ID NO of amino arid 




sequence 




sequence 


TESTI201 64210 


1586 


526.. 1593 


3556 


TESTI201 65680 


1587 


154.. 552 


3557 


TESTI201 65990 


1588 


201. .992 


3558 


TESTI201 66290 


1589 


351.. 2786 


3559 


TESTI201 66670 


1590 


754 1938 


3560 


TESTI20 167580 


1591 


1152 >1543 


3561 


TESTI201 68880 


1592 


201.. 524 


3562 


TESTI201 69500 


1593 


280. .1674 


3563 


TESTI20170170 


1594 


199.. 1530 


3564 


TESTI201 70280 


1595 


1..696 


3565 


TESTI201 70690 


1596 


61.. 474 


3553 


TESTI201 70890 


1597 


882 1238 


3567 


TESTI20171070 


1598 


1088 1501 


3568 


TESTI20 173050 


1599 


724.. 1218 


3569 


TESTI20173110 


1600 


1194 1586 


3570 


TESTI201 73960 


1601 


376 1764 


3571 

Oj/ 1 


TESTI20 175370 


1602 


RPft 1 794 


3579 


TEST1201 76450 


1603 


1 ..996 


QK7Q 

ou / o 


TESTI201 79230 


1604 


162 170Q 


3574 


TESTI201 79510 


1605 


1702 !>204? 


3575 
00/ 0 


TESTI201 80600 


1606 


1 62.. 857 


357fi 


TESTI20 182210 


1607 


736 1 1 HP 

/ uU ..II Ol 


^77 

00/ / 


TESTI201 82760 


1608 


ofifi l3Qn 


3573 


TESTI201 83680 


1609 


773 >1QP7 


357Q 


TESTI201 84280 


1610 


->173p 


3530. 


TESTI20 184750 


1611 


fiQQ PI 1 R 

0\3\j . . c, 1 ID 


3531 


TESTI201 84760 


1612 


161 ..1741 


3539 


TESTI 20 184820 


1613 


1RQ 7PP 


0000 


TESTI20186110 


1614 


PP1 Ifififi 


^fl4 


TESTI201 92570 


1615 


109 687 


3535 
0000 


TESTI201 93080 


1616 


149.. 21 94 


3533 


TESTI201 93520 


1617 


311 1375 


3537 
000/ 


TESTI201 94880 


1618 


616 1554 


3533 


TESTI20 196690 


1619 


271 ..945 


353Q 

0909 


TESTI201 96970 


1620 


78Q 1565 


OJ9U 


TESTI201 97030 


1621 


263. .1717 


35Q1 


TESTI201 97290 


1622 


643 1314 


35QP 


TESTI201 97600 


1623 


100 QOQ 


35Q3 


TESTI201 98540 


1624 


21 ..1553 


3594 


TESTI201 98600 


1625 


909 1328 


35Q5 


TESTI20199110 


1626 


622..1617 


35Q3 


TESTI201 99980 


1627 


764.. 11 02 


3597 


TESTI20200120 


1628 


661. .966 


3598 


TESTI20200840 


1629 


378..818 


3599 


TESTI20201760 


1630 


22.. 603 


3600 


TESTI20202830 


1631 


631.. 972 


3601 


TESTI20204260 


1632 


71 4.. 1406 


3602 


TESTI20205100 


1633 


270.. 1763 


3603 


TESTI20205150 


1634 


1286.. 1663 


3604 
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Table 1 (continued) 



Clone name 


SEQ ID NO of nucleotide 


Position of CDS 


SFO ID NO of amino ariri 

OL\j< \\J llU, Wl dllllMO UOl VJ 




sequence 




sequence 


TESTI20205250 


1635 


32.. 1888 


3605 


TESTI20207170 


1636 


1..795 


3606 


TESTI20209050 


1637 


269.. 1900 


3607 


TESTI20210030 


1638 


328. .675 


3608 


TESTI20210570 


1639 


280 1362 


3609 


TESTI20211380 


1640 


372.. 848 


3610 


TESTI20212970 


1641 


93. .21 02 


3611 


TESTI20214630 


1642 


690 1106 


3612 


TESTI20215310 


1643 


294 899 


3613 


TESTI20219110 


1644 


982 1344 


1R14 

OO I *T 


TESTI20219390 


1645 


253 573 


1R15 

OO I o 


TESTI20220230 


1646 


116. .967 


3616 


TESTI20221 790 


1647 


109 1830 


1R17 

OO 1 / 


TESTI20222030 


1648 


277.. 2277 


1R1S 

OO 1 o 


TESTI20222460 


1649 


162 1889 


1R1Q 

OO 1 57 


TESTI20223380 


1650 


322.. 831 


1R90 


TESTI 20226520 


1 651 


178 11Q2 


1R91 


TESTI20227380 


1652 


20.. 1747 


1R99 


TESTI20228120 


1RR1 


91 8R8 


1R91 


TESTI20228740 


1654 


R07 102R 


1R94 


TESTI20244220 


1655 


757 2448 


1R95 


TESTI20244430 


1 656 


17 1 A Ad 


1R9« 


TESTI20244460 


1 657 


«Q4 leu 

OOt, .ION 


1R97 


TESTI20244730 


1658 


127 1044 


1R9ft 


TESTI20245600 


1 RRQ 

I UOC7 






TESTI20245860 


1660 


R84 1 058 


ifiin 
ooou 


TESTI20246410 


1661 


1511 1818 

1 J 1 J.. IO I o 


1R11 


TESTI20246480 


1 RR2 


QRO 11QQ 


1R19 


TESTI20247440 


1 663 


t ttJ..aU/ 


1R11 


TESTI20248850 


1 664 


3..1 829 




TESTI20249360 


1 665 


roo 15R5 


1R15 
OOOO 


TESTI20250220 


1666 


63 2015 


1R1R 

OOOO 


TESTI20250630 


1667 


482 16Q0 


1R17 
ooo / 


TESTI20251 440 

1 1 — *~t 1 \C\J C*J 1 1 IV 


1668 


42 1271 

H-^ . - lc/ I 


OOOO 


TESTI20251610 


1 669 


410 1218 


1R1Q 


TESTI20251 740 


1670 


710 1104 

/ JU ..II ot 


1R40 
OOfU 


TESTt20252690 


1671 


1182 2118 


1RA1 


TESTI20254030 


1672 


363 1958 


1R49 


TESTI20254090 


1673 


878 1360 


1R41 

OOtO 


TESTI20254480 


1674 


320 13Q0 


1R44 


TESTI20254990 


1 675 


6Q3 17QQ 


1R45 
oo*+o 


TESTI20255460 


1676 


152..2710 


3646 


TESTI20256560 


1677 


169..>2257 


3647 


TESTI20257910 


1678 


457.. 1311 


3648 


TESTI20258720 


1679 


38.. 1630 


3649 


TESTI20259110 


1680 


486.. 1805 


3650 


TESTI20259200 


1681 


227.. 1369 


3651 


TESTI20260140 


1682 


1113..1421 


3652 


TESTI20260640 


1683 


6.. 1109 


3653 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SFO in NO nf flminn aHH 

OlU 1 VJ VAKJ. Ul CUIIIIIU dUlll 




sequence 




sequence 


TESTI20261040 


1684 


14.. 970 


3654 


TESTI20261160 


1685 


257. .868 


3655 


TESTI20261680 


1686 


1422 1901 


3656 


TESTI20262150 


1687 


315 2765 


1RR7 


TESTI 202 62940 


1688 


255 1*574 


ODOO 


TESTI20264530 


1689 


150 1580 


OU93 


TESTI20264910 


1690 


1638 2378 

1 UUU . . L. w / U 


1RR0 


TESTI20265150 


1691 


1023 140Q 


1RR1 
000 1 


TESTI20265340 


1692 


965 1552 


1RR9 


TESTI20265890 


1693 


QQ4 110ft 


ODOO 


TESTI20266050 


1694 


116. .1522 


1RR4 


TESTI20268240 


1695 


70 1R1ft 


ODOO 


TESTI20269250 


1 696 


17 -^1Q9R 


OOOO 


TESTI20269360 


1697 


fifiP 100R 


1RR7 
ODD/ 


TESTI20270130 


1 698 


99. .512 


IRRft 
OOOO 


TESTI20271790 


1 RQQ 


101 1R99 


OODs 


TESTI20272380 


1 700 
1 # w 


191 4Q9 


00/ u 


TESTI 202 74960 


1701 


R1Q 171R 

U 13.. I / ID 


QR71 
JO/ 1 


TESTI 20277300 


1 702 


9R1 1R00 
tOO.. I ouu 


OO /£ 


TESTI20278280 


1 703 


ftO R07 


OO/O 


TFSTI20282420 


1 704 


CO COO 
sjc.. .*JC.O 


OD/*+ 


TFSTI20P89R10 


1 70R 


oyy.. i goo 


QC7C 


TFSTI20282QOO 


1 70R 


in 141 

OU. .OH 1 


OO/O 


TESTI 20284260 

1 l — » 1 Il.ul.Ut^l.UU 


1 707 


9Q 4Q1 


QR77 
OD/ / 


TESTI20285210 


1 70ft 


1710 91RR 


OO/O 


TFSTI909Rfi^Q0 


1 70Q 


R1Q 1191 


oo/y 


TESTI20287760 


1 710 


/ / O. ,cDof 


OOOU 


THYMU 10004280 


1 711 


R17 Qftft 
0 1 / . .900 


OOO 1 


THYMU2Q00R020 

1 1 1 1 IrlUCVUUUvLv 


1 719 

1 / II 


R0R 1*387 
QUO. . I 00/ 




THYMU20007020 

1 1 1 1 IVIVJL.UUU / \JC-\i 


1711 

1 / IO 


17CC 9111 

I / sJO. ,l ( I I 


ODOO 


THYMU20007750 


1 714 


1077 14QQ 




THYMU20008000 

1111 IVIUL.VJUUUUUU 


171S 


1R0R 10*^ 


ODOO 


THYMU20009460 


1716 


1314 1791 

1 JOH. . 1 / L.O 


OOOO 


THYMU20009500 


1717 


717 9117 


OOO/ 


THYMU2000Q71 0 


171ft 

1 f IO 


947 7R9 


OOOO 


THYMU20010180 

1 1 1 1 IVI IVI Uv 


1719 


1 Q1 R14 
1 1 . .00*+ 


ouoy 


THYMU20010710 

1 1 1 1 IVI WfaVv 1 V/ 1 V 


1 790 


ftcn 11RA 

OJU.. 1 i o*+ 


QRon 


THYMU200 12020 

fill IVIvLvw 1 C—\J^\J 


1 721 
1 1 c \ 


11R0 1RR1 


^RQ1 


THYMU200 12560 


1 722 


47 517 


^RQ9 


THYMU200 13250 

1 1 1 1 IVIw^W 1 U&JU 


1 791 

I / L.O 


111 110Q 


ODaO 


THYMU200 13810 

1 1 1 1 IIIV/CVV/ 1 \J\J 1 V/ 


1 794 


1QR Q97 

ISO.. <3£. 1 


000*+ 


THYMU20014430 


1725 


1280.. 1678 


3695 


THYMU20017270 


1726 


16..660 


3696 


THYMU20018250 


1727 


1205..2122 


3697 


THYMU20018390 


1728 


237.. 1373 


3698 


THYMU20019000 


1729 


405.. 758 


3699 


THYMU20019260 


1730 


31 7.. 736 


3700 


THYMU20020370 


1731 


1332.. 1733 


3701 


THYMU20020800 


1732 


600.. 1196 


3702 
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Table 1 (continued) 



filnnp name* 






otu iu rvivj. ot amino acio 




sequence 






THYMU20021090 


1733 


211 ..1281 


O/ UO 


THYMU20021540 


1734 


100 1R7R 


O/UH 


THYMU20023560 


1735 


1429 1884 


^70R 

Of UO 


THYMU20024500 


1736 


1QQ 1RR4 

1 \J\J . . 1 Jut 


o / uu 


THYMU20025480 


1737 


13RP 17R1 

1 JUL., 1 / J 1 


^707 


THYMU20026950 


1 738 


Q QPO 


^7ftfl 
O/ UO 


THYMU20028150 


1 73Q 


oqq iooo 


o/uy 


THYMUP00PR41 0 




10 10RR 

1 v., 1 vOD 


Q71 ft 
O/ I u 


THYMU20029830 


1741 


4R 4RR 


^71 1 
O/ I I 


THYMUP00304RO 


174P 

1 I Ht 


HDO.. 1 1 / D 


Q71 O 


THYMLJPOO'lOfiQO 


1 743 
l / *f o 


i CCO OOCQ 
1 ODD. .ddxyty 


071 Q 
Of IO 


THYMUP0031 330 


1 744 


QO RRQ 
yu..ooy 


0.71 A 
Of 1 H 


THYMUP003PRP0 


1 74R 

I /HO 


PRO 117R 
^OU.. II /D 


O/ IO 


THYMUP0034400 


1 74R 


7Q1 119Q 
/ y I .. I I 


Q71 A 
O/ 1 O 


THYMUP00347QO 

1 1 1 1 IVIUtvUO'T/ ?7U 


1 747 


1 PO ARO 


Q71 7 
O/ 1 / 


THYMI JP003RR00 
i n i iviu^uuououu 


1 74R 
I / HO 


00.. I DO/ 


O74 Q 
O/IO 


THYMI JP003Q3PO 
i n i iviU£UUOt70£U 


1 74Q 


CO/ ^ OQQ 


Q"71 Q 

0/ 1 y 


THYMUP0043440 

III! iVIUfcUUHvJ'T'+y 


1 7R0 


1^01 1R1R 
I OU I..IO io 


Of d\J 


TH YML JP0043RRO 


1 7R1 
I /o I 


OQO fiOi 


O/^l 


THYMI JP00441 00 


1 7RP 
I /Ot 


1 Qftft OOOQ 
1 0\J\J . 


0700 

of dc 


THYMI JP0044RP0 


1 7R*3 

I / JO 


1 QP7 1 QQQ 

i O0/..1 oyy 


0700 
0/^0 


THYMI JP004R3R0 


1 7RA 


1 OOO..^OOl 


0/^4 


THYMI IPftftAR77ft 


1 /DO 


o^i ..b/1 


070C 

3725 


thymi jpnndQORn 


1 7RR 
I /DO 


■f 70 7*50 


o/^b 


thymi iPftftRftftift 


1 7K7 

1 loi 


<lUO.. IOO 1 


0707 

O/z?/ 


THYML IP0DR1 3Aft 


1 / JO 


one h ono 


0700 


thymi jpoorp4ro 

P • 1 1 IVIU^UUO£H-UU 


1 7RQ 


I jUU.. 1 OOO 


o/^y 


thymi ipnftRpRQn 

i n T IV)Li£UUO£OOU 


1 7C.fi 

1 /bu 




3730 


thymi ipnftRziflftft 


1 7C1 


4/y..ooU 


070 4 
O/Ol 


THYML JP00RR4R0 


1 7RP 




O/O^ 


THYMI IPftftRRARft 


1 7CQ 
1 / DO 


<£U..OO 1 


O/OO 


THYMI IPftORR7Aft 


1 7flA 


yui .. I04 1 


o/o4 


THYMUP00RR7R0 


1 7RR 

I / DO 


p<*p 7<in 


of OO 


THYMI ipoorrrro 


1 7RR 
1 /DO 


1 1 7ft 1 

1 1 /u.. 1 ^yo 


o/ob 


thymi jpooro4ro 


1 7R7 
1 /O/ 




0707 
0/0/ 


thymi jpoorprpo 

1 1 1 1 IVIUHUUU£0£U 


1 7RR 
I / Do 


OQQ CCC 


of 00 


thymi jpoorpri n 


1 7RQ 

i /oy 


HOC HQCO 

i oo.. i yoo 


o/oy 


THYMI JP00RP770 

1 I 1 I \V\\J£.\J\J\JC-f /U 


1 77ft 
I / /U 


oo i .. 1 1 yy 


0/4U 


THYMUP0063650 


1 771 
lift 


PR4 ftftft 


O/ 41 


THYML JP00R4RR0 


1 77P 
I / Id 


OCQ ^01 


Of HC. 


THYMI JPOORRRRO 


1 773 
I / f O 


4ftR 1i4R7 
4UO.. l*fo/ 


0/ 4o 


THYMU20069130 


1774 


341.. 961 


3744 


THYMU20069460 


1775 


231.. 575 


3745 


THYMU20069650 


1776 


728.. 1036 


3746 


THYMU20070250 


1777 


81. .1703 


3747 


THYMU20071120 


1778 


170.. 1792 


3748 


THYMU20071460 


1779 


152.. 499 


3749 


THYMU20072580 


1780 


1981.. 2286 


3750 


THYMU20073070 


1781 


551.. 1225 


3751 
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Table 1 (continued) 



I Clone name 


SEQ ID NO. of nucleotide 
sequence 


Position of CDS 


OLW 1 LJ INU. Ul ctllllllU aolU 

seauence 


THYMU20073080 


1782 


755.. 11 98 


O / O^ 


THYMU20077250 


1783 


645.. 1241 


0 / oo 


THYMU20078020 


1784 


1290 1598 


Of 3H 


THYMU20078240 


1785 


1506 Ifl^fl 


o/oo 


THYMU20079690 


1786 


104Q 1714 

1 Ut9. .lilt 


Of OO 


THYMU20080490 


1787 


S7..398 


Of o/ 


THYMU20081110 


1788 


62Q 1P4R 


o/oo 


THYMU20083390 


1789 


592.. 924 


^75Q 
Of Oa 


THYMU20083500 


1790 


744,, 1067 


^7R0 
Of uu 


THYMU20083830 


1791 


155 17RR 

i ju. ■ i » ou 


Of 0 I 


THYMU20084520 

lilt 1VI vcvvU^wuv 


1792 


PRR 5Q7 

C\j\j . .09/ 


^7RP 

Of U£ 


THYMU20086430 


1793 


7RQ 1077 

/ 0\7. . IU/ / 


O/OO 


THYMU20087270 


1794 


R4P POOR 


Of 0*f 


THYMU20089170 


1795 


one fi-jo 

OU3..U I 9 


Of 03 


THYMU20089900 


1796 


100R 14P1 

1 UUO. . 1 *+c 1 


O / OD 


THYMU20090230 

fill IVIWCvvwvfaWV 


1 797 


10R 5Q4 

1 UO.,J3f 


Q7C7 

Of Of 


THYMU20091040 

1 1 1 1 IVI WCvVxy 1 VTv 


1 798 


7PR 1 0^4 


O/ OO 


THYMU20095920 


1 799 


1 ROQ P°*7R 

1 OU9. .<lO / O 


of oy 


THYMU20096580 


1 Ron 


ipn rqp 


Q77A 

OY /U 


THYMU20097920 

1 1 1 1 IVI ULvvJ / \J C^.\J 


1 801 




Q771 
O/ / 1 


THYMU20098350 


1802 


QQ IRHp 
99. . 1 OUC 


^779 


THYMU200990R0 


1 R0°, 


Rfl 1 ^7** 
OU. . lO/O 


0770 
O/ /O 


THYMU201 00940 

1 1 1 1 1 VI UCU | WUvTV 


1R04 
i out 


R PP1 R 


OY /4 


THYMUP01 044R0 


1 RO^ 

1 DUO 


oy.. i*K>i 


Q77C 
O/ /O 


THYMUP01 OfiQQO 

1111 IVIUL.U i UUuuU 


1 OUO 


O/O., l*fO*f 


077C 

of /o 


THYMU201 10720 


1 R07 


0.7P 1 07fl 


Q777 
Of 1 1 


THYMU201 12570 


1808 


117 4RR 

1 I / ..too 


Q77Q 
0/ /o 


THYMU201125Q0 


1R0Q 


11 P 1 RP7 


Q77Q 

o/ /y 


THYMU201 15380 


1R10 

IOIU 


1^07 1R^Q 


O/ OU 


THYMU201 15730 


1 811 


Q/O 1 coo 


OfOi 


THYMU201 17850 

1 1 1 1 IVIUCv 1 1 r UU v 


1 812 


1RP*> 1Q4R 

1 U£lO. . 1 3HO 


O/ 0<1 


THYMU201 20240 


1R1°, 


iooq qnnp 


O/OO 


THYMU201 20730 

III* IflWbV I C- \J f Ju 


1814 


1P^7 P4f»Q 

1 £lO / . . *L*+09 


O/ 0*f 


THYMU20121040 


1 R1 R 

IOI J 


pqn pqpq 


O/OO 


THYMU201 28910 


1 R1 fi 

IOIU 


1R1 11RR 

i o i . . i i oo 


O/OO 


THYMU201 29020 


1817 




<17R7 
Of 0/ 


THYMU201 30470 

1 1 1 1 IVIVbv 1 wv™/ V 


1818 


R7 11RQ 

o / . . 1 1 U9 


0700 
O/OO 


THYMU201 34260 


1819 


I OHO. . C I / I 


0/ oy 


THYMU201 37050 


1820 


Q58 1341 


0/ au 


THYMU201 37570 


1821 


P11Q P«571 


^7Q1 

0/ y 1 


THYMU20139160 


1822 


RQ RRR 


0/ y^ 


THYMU201 40510 


1823 


1312.. 1626 


3793 


THYMU201 43230 


1824 


575.. 2023 


3794 


THYMU201 45990 


1825 


1330. .2262 


3795 


THYMU201 48010 


1826 


326. .820 


3796 


THYMU201 49230 


1827 


581. .925 


3797 


THYMU20150190 


1828 


1516.. 1908 


3798 


THYMU20151610 


1829 


218. .1663 


3799 


THYMU201 53210 


1830 


1135..2340 


3800 



42 



EP1 308 459 A2 



Table 1 (continued) 



5 



10 



15 



20 



25 



Clone name 


SEQ ID NO. of nucleotide 


Position of CDS 


SEQ ID NO of amino acid 




sequence 




sequence 


THYMU201 54790 


1831 


1220.. 1753 


3801 


THYMU20 157620 


1832 


771.. 1487 


3802 


THYMU201 63600 


1833 


1216.. 1650 


3803 


THYMU201 70080 


1834 


944.. 1399 


3804 


THYMU201 70230 


1835 


835.. 1941 


3805 


THYMU20171580 


1836 


821 1150 

Ufii 1 ■ • 1 1 WW 


3806 


THYMU201 74490 

llll IVI W(.v 1 I ■ 1 1 \J \S 


1837 


52.. 453 


3807 


THYMU20 174790 


1838 


29.. 370 


3808 


THYMU201 75260 


1839 


991 1452 


3809 


THYMU201 76010 

llll IVIWtV 1 / VV I V 


1840 


12.. 2039 


3810 


THYMUP01 77070 

llll IVIWCU 1 / i \J I \J 


1841 


1059 1415 

1 www . . ItU 


3811 


THYMU201 78440 

llll IVIULU 1 f V 


1842 


81. .1517 


3812 


THYMU20181890 


1843 


102 1652 


3813 


THYMU20 184550 

llll IVI \J £—\J 1 U*T^Jv 


1844 


1 71 1 886 


3814 


THYMU201 85470 


1845 


161. .817 


3815 


THYMUP01 85650 


1846 


ww . .sJOU 


3816 


THYMUP01R7P10 

1 1 1 I W\\J C\J ID / C IV 


1847 


190 507 


3817 


THYMUP01 91 970 


1 848 


230 1597 


3818 


TKIHN1 nnonfipo 


1 R49 

1 0*+57 


1075 1419 

1 U / O. . 1 *+ 1 57 


38 1Q 

OO 1 57 


TKinNionni7in 

1 fXlL^IM 1 UUU 1 / lu 


1 R50 


1RR5 P3R5 


3RP0 


TKIDN100019PO 


1851 


98.. 793 


3821 


TRAnH?ooimio 


1R5P 


PPR P033 


3RPP 




1 RR3 


1 P5 47R 


3RP3 


TRAPHPflfM PdQfl 


1 R54 

1 Out 


OO . . 1 ouu 


3RP4 


TRACHP00P1 000 


1 855 


234.. 854 


3825 


TRACHP00P1 3R0 


1 R5R 


313 1R90 

O 1 O. . 1 057U 


3RPR 


TRACHP00P5370 


1857 


Q47 1456 


3827 




1 R5R 
I ooo 


9R4 1 557 


3flPfl i 

OO^O 


TRAPHPnOPQAflf) 


1 R59 


P01 1R07 


3RPQ 


TRAPHpnn4n^Qn 

i n/Awn<nJU*+uov7U 


1 860 


1P5 331 Q 

1 £.\J . . OO 1 57 


3830 


i rirtori^vjvj*+ i uyu 


1 OU 1 


1 P54 1 5R9 


3R31 

OOO I 


TRACHP00433fifi 


1 RRP 

1 OU£ 


30 POfifi 


3R3P 


TRACHP0044990 


1 863 


POQ 790 


3833 


TRACHP0049500 


1 864 


840 116R 


3834 


TRACHP0051 590 


1 865 


927.. 1247 


3835 


TRACHP0057P00 


1866 


P08P 2393 


3836 


TRACHP0058000 


1867 


187.. 1647 


3837 


TRACH20073990 

i i inv/i i£- w / %j<j\s\j 


1868 


98.. 454 


3838 


TRACH20080810 


1869 


2073 >2910 


3839 


TRACHP0081 P70 


1870 


1681 >PG25 


3840 


TRACHP0090060 


1871 


309 1781 


3841 


TRACH20091070 


1872 


129.. 1841 


3842 


TRACH20093400 


1873 


131. .2251 


3843 


TRACH20093480 


1874 


694.. 1092 


3844 


TRACH20098510 


1875 


142.. 1944 


3845 


TRACH20101590 


1876 


1660.. 2220 


3846 


TRACH201 04510 


1877 


144.. 1184 


3847 


TRACH201 08240 


1878 


36..791 


3848 


TRACH201 13020 


1879 


177.. 1352 


3849 
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Table 1 (continued) 



5 



10 



15 



20 



25 



30 



35 



Clone name 


SEQ ID NO of nucleotide 


Position of 

I WOIUWII Wl wUO 


RFO in NO of amino ariH 

O LI W 1 LJ INU. Wl allllMW CtWIU 




sequence 




sequence 


TRACH201 22980 


1880 


73P 156P 


3850 


TRACH201 23870 


1881 


P35 S40 


3851 


TRACH201 24970 


1882 


109 486 
1 w^, , tow 


3852 


TRACH201 25620 


1883 


22.. 387 


3853 


TRACH20129180 


1884 


1 5.45 1 RQR 


0004 


TRACH20131230 


1885 


1?Q PQQQ 


0000 


TRACH201 39280 


1 886 


1 CO H 
1 Jt„ 1 *+/ H 


^RRR 
0000 


TRACH20140180 


1887 


prn 553 


^RR7 
000/ 


TRACH2 01 43710 


1 888 


1P1Q 1QQR 


*3RCR 
OOOO 


TRACH201 49500 


1889 


IRQ Q06 

1 WW. . \7WW 


^RCQ 

oooy 


TRACH201 49720 


1 RQ0 


1R7 1RQC. 


OODw 


TRACH201 49740 


1 8Q1 


RR4 1 COR 
004., 1 JUJ 


OOO I 


TRACH201 58240 


1 RQP 


P54 7QR 


^n«o 


TRACH201 59390 


1 RQ1 


1R1R 1Q41 


0000 


TRACH2016 


1894 


1 PP°, 1 R31 


OO OH 


TRACH201 63470 


1 RQ5. 


Q71 ^^Cfi 


0000 


TRACH20164100 


1 RQR 


070 700 
Old...! 


OOOO 


TRAnHPOl 64R1 0 


1 RQ7 
1 057/ 


icoc oioa 


QQC7 
OOO/ 


TRACHP01 RW^O. 


1 RQR 


/w*f.. 1 1 


OOOO 


TRACHP01 fi5S4n 


1 RQQ 

1 OSS 


ol w.. 1 Ol *r 


oooy 


TRACHP01 R70Q0 
i nnuntu i u / W\7W 




R7 HZtt 


OQ7A 

oo/U 


TRACHP01 70flfi0 


1 Q01 

I i7W I 


1 RR 1 ROC 




TRAHHP01 7^RRfl 


1 QO.P 


QQ i CQ7 


OO Id. 


TRAHHP01 74QR0 


i yuo 


yo. . i uoo 


OQ7Q 
OO/O 


TRAHHP01 RP7R0 


1 Q04 


1 RAO 0070 
1 OD9, ,c\j{ U 


00/4 


TRACHP01851P 


1 QOR 


1 ^ *^O.Q 


QQ7C 
00 /0 


TRACHP01 88350 


1 906 


P4R mi*? 


007c 
00/D 


TRAPHP01 Qfl4fifl 


1 Q07 


QCH H COR 


0077 
00/ / 


umvfni nnm iRn 

vjivi v izi n i uuu i ooy 


1 QOR 


OiOO O/IQR 


QQ7Q 
00/0 


UTFRU1 0001 ORO 

w 1 LMU I www i www 


1 Q0.Q 


i B7 CCO 
10/.. DO*i 


00/ y 


i jtfri )1 0001 R7n 

\j 1 cnu i www io/u 


1 qi o 
1 y 1 u 


1 DO 1 ..dUol 


000U 


UTFRUPOOOOP^O 


1 Q11 


1 Pfl ROJl 
I ^U..DUO 


OOO 1 


UTFRUPOOOOQ^iO 


1 Q1P 

19 1c 


P40 1 RCQ 
tHU.. 1 OOO 




UTFRUP0011760 


1 & i 0 


1 71 Q POOH 


OOOO 


UTFRUP001 ^RQO 


1 Q14 


1 o^m-.. 1 yoi 


QQQi 

00 o4 


UTERUP001 6580 


1 Q1S 


PC 1 PR7 


QQQC 
OOOO 


UTFRUP00P66P0 


1 Q16 

13 ID 


mpn irc;r 

I W^W. . I OOO 


OOOO 


UTERU20027360 


1 Q17 


1 RP4 1 QQP 


OOO/ 


UTERU20029930 


1918 


RP 1Q0P 

DC.. 1 %7W£. 


QOQQ 

OOOO 


UTFRUP0031350 
w I Lnutwo i wow 




^C C90 ! 
OO. .Oil w 


QOQQ 

oooy 


UTFRUP0035770 

v I L_l 1VL.UUOJI / W 


1 Qpn 


Qpn 1 o«o 


QQQO 

ooyu 


UTERU20040150 


1921 


917.. 1717 


3891 


UTERU20040370 


1922 


22.. 1866 


3892 


UTERU20040390 


1923 


727.. 1200 


3893 


UTERU20040730 


1924 


1009..>2303 


3894 


UTERU20041630 


1925 


105.. 1328 


3895 


UTERU20041970 


1926 


26..484 


3896 


UTERU20045200 


1927 


6..917 


3897 


UTERU20051790 


1928 


41 5.. 1341 


3898 
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Table 1 (continued) 





pinno n a mo 
OIUIIc MalllU 


OCA I Pi Mfi Af ni lAlootiHo 

ocw il/ iNL/. or nucieuiiut; 
seauenee 


Pnclfinn of PnQ 


obvj iu inu. or amino acid 

con 1 1 0 n ao 
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[0028] Namely, primers used to synthesize polynucleotides can be designed based on the nucleotide sequences of 
polynucleotides of the present invention shown in SEQ ID NOs in the above Table 1 . When one intends to synthesize 
full-length cDNAs, an oligo dT primer can be used as the 3'-end primer. The length of the primers is usually 15-100 
bp, and favorably between 15-35 bp. In case of LA PCR, which is described below, the primer length of 25-35 bp may 
provide a good result. 

[0029] A method to design a primer that enables a specific amplification based on the aimed nucleotide sequence 
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is known to those skilled in the art (Current Protocols in Molecular Biology, Ausubei et al. edit, (1987) John Wiley & 
Sons, Section 6.1-6.4). In designing a primer based on the 5'-end sequence, the primer is designed so as that, in 
principle, the amplification products will include the translation start site. Accordingly, for example, when the 5'-end 
primer is designed based on the nucleotide sequence of 5' untranslated region (5'UTR), any part of the 5'-end, which 

5 ensures the specificity to the cDNA of interest, can be selected as the primer 

[0030] When synthesizing a full-length cDNA, the target nucleotide sequence to be amplified can extend to several 
thousand bp in some cDNA. However, it is possible to amplify such a long nucleotides by using such as LA PCR (Long 
and Accurate PCR). It is advantageous to use LA PCR when synthesizing long DNA. In LA PCR, in which a special 
DN A polymerase having 3' -> 5* exonuclease activity is used, misincorporated nucleotides can be removed. Accordingly, 

10 accurate synthesis of the complementary strand can be achieved even with a long nucleotide sequence. By using LA 
PCR, it is reported that amplification of a nucleotide with 20 kb longer can be achieved under desirable conditions 
(Takeshi Hayashi (1996) Jikken-lgaku Bessatsu, "Advanced Technologies in PCR" Youdo-sha). 
[0031] A template DNA for synthesizing the full-length cDNA of the present invention can be obtained by using cDNA 
libraries that are prepared by various methods. The full-length cDNA clones of the present invention are clones with 

15 high probability of completeness in length, which were obtained by the method comprising the steps of [1] preparing 
libraries containing cDNAs with the very high fullness ratio by oligo-capping, and [2] assembling the 5'-end sequences 
and selecting one with the highest probability of completeness in length in the cluster formed (there are many clones 
longer in the 5'-end direction). 

[0032] However, the uses of primers designed based on the full-length nucleotide sequences provided by the present 

20 invention enable easily obtaining full-length cDNAs without such a special technique. 

[0033] The problem with the cDNA libraries prepared by the known methods or commercially available is that mRNA 
contained in the libraries has very low fullness ratio. Thus, it is difficult to screen full-length cDNA clone directly from 
the library using ordinary cloning methods. The present invention has revealed a nucleotide sequence of novel full- 
length cDNA. If a full-length nucleotide sequence is provided, it is possible to synthesize a target full-length cDNA by 

25 using enzymatic reactions such as PCR. In particular, a full-length-enriched cDNA library, synthesized by methods 
such as oligo-capping, is desirable to synthesize a full-length cDNA with more reliability. 

[0034] The 5*-end sequence of the full-length cDNA clones of the invention can be used to isolate the regulatory 
element of transcription including the promoter on the genome. A rough draft of the human genome (analysis of human 
genomic sequence with lower accuracy), which covers 90% of the genome, has been reported (Nature, Vol.409, 

30 81 4-823, 2001 ), and by the year 2003, analysis of the entire human genomic sequence is going to be finished. However, 
it is hard to analyze with software the transcription start sites on the human genome, in which long introns exist. By 
contrast, it is easy to specify the transcription start site on the genomic sequence using the nucleotide sequence which 
includes the 5'-end of the full-length cDNA clone of the present invention, and thus it is easy to obtain the genomic 
region involved in transcription regulation, which includes the promoter that is contained in the upstream of the tran- 

35 scription start site. 

[0035] The polypeptide encoded by the full-length cDNA of the invention can be prepared as a recombinant polypep- 
tide or as a natural polypeptide. For example, the recombinant polypeptide can be prepared by inserting the polynu- 
cleotide encoding the polypeptide of the invention into a vector, introducing the vector into an appropriate host cell and 
purifying the polypeptide expressed within the transformed host cell, as described below. In contrast, the natural 

40 polypeptide can be prepared, for example, by utilizing an affinity column to which an antibody against the polypeptide 
of the invention (Current Protocols in Molecular Biology (1987) Ausubei et al. edit, John Wiley & Sons, Section 
1 6. 1 -1 6. 1 9) is attached. The antibody used for affinity purification may be either a polyclonal antibody, or a monoclonal 
antibody. Alternatively, in vitro translation (See, for example, "On the fidelity of mRNA translation in the nuclease-treated 
rabbit reticulocyte lysate system." Dasso M.C., and Jackson R.J. (1989) Nucleic Acids Res. 17: 3129-3144) may be 

45 used for preparing the polypeptide of the invention. 

[0036] Polypeptides functionally equivalent to the polypeptides of the present invention can be prepared based on 
the activities, which were clarified in the above-mentioned manner, of the polypeptides of the present invention. Using 
the biological activity possessed by the polypeptide of the invention as an index, it is possible to verify whether or not 
a particular polypeptide is functionally equivalent to the polypeptide of the invention by examining whether or not the 

50 polypeptide has said activity. 

[0037] Polypeptides functionally equivalent to the polypeptides of the present invention can be prepared by those 
skilled in the art, for example, by using a method for introducing mutations into an amino acid sequence of a polypeptide 
(for example, site-directed mutagenesis (Current Protocols in Molecular Biology, edit, Ausubei et al., (1 987) John Wiley 
& Sons, Section 8.1-8.5). Besides, such polypeptides can be generated by spontaneous mutations. The present in- 

55 vention also includes a polypeptide comprising the amino acid sequence shown in Table 1 in which one or more amino 
acids are substituted, deleted, inserted, and/or added, as long as the polypeptides have the equivalent functions to 
those of the polypeptides identified in the present Examples described later. 

[0038] There are no limitations on the number and sites of amino acid mutations, as long as the polypeptides maintain 
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the functions thereof. The number of mutations typically corresponds to 30% or less, or 20% or less, or 10% or less, 
preferably 5% or less, or 3% or less of the total amino acids, more preferably 2% or less or 1% or less of the total 
amino acids. Alternatively, herein, substitution of one or more amino acids includes substitution of several amino acids. 
As used herein, the term "several amino acids" means, for example, 5 amino acids, preferably 4 or 3 amino acids, 

s more preferably 2 amino acids, and further preferably 1 amino acid. 

[0039] From the viewpoint of maintaining the polypeptide function, it is preferable that a substituted amino acid has 
a similar property to that of the original amino acid. For example, Ala, Val, Leu, lie, Pro, Met, Phe and Trp are assumed 
to have similar properties to one another because they are all classified into a group of non-polar amino acids. Similarly, 
substitution can be performed among non-charged amino acid such as Gly, Ser, Thr, Cys, Tyr, Asn, and Gin, acidic 

10 amino acids such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0040] In addition, polypeptides functionally equivalent to the polypeptides of the present invention can be isolated 
by using techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the 
hybridization technique (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, Section 
6.3-6.4)), those skilled in the art can usually isolate a polynucleotide highly homologous to the polynucleotide encoding 

15 the polypeptide identified in the present Example based on the identified nucleotide sequence (Table 1) or a portion 
thereof and obtain the functionally equivalent polypeptide from the isolated polynucleotide. The present invention in- 
clude polypeptides encoded by the polynucleotides hybridizing with the polynucleotides encoding the polypeptides 
identified in the present Example, as long as the polypeptides are functionally equivalentto the polypeptides identified 
in the present Example. Organisms from which the functionally equivalent polypeptides are isolated are illustrated by 

20 vertebrates such as human, mouse, rat, rabbit, pig and bovine, but are not limited to these animals. 

[0041] Washing conditions of hybridization for the isolation of polynucleotides encoding the functionally equivalent 
polypeptides are usually "1x SSC, 0.1% SDS, 37°C"; more stringent conditions are "0.5x SSC, 0.1% SDS, 42°C"; and 
still more stringent conditions are "0.1 x SSC, 0.1% SDS, 65°C". Alternatively, the following conditions can be given as 
hybridization conditions of the present invention. Namely, conditions in which the hybridization is done at "6x SSC, 

25 40% Formamide, 25°C", and the washing at "1x SSC, 55°C" can be given. More preferable conditions are those in 
which the hybridization is done at "6x SSC, 40% Formamide, 37°C", and the washing at "0.2x SSC, 55°C W . Even more 
preferable are those in which the hybridization is done at "6x SSC, 50% Formamide, 37°C'\ and the washing at "0.1 x 
SSC, 62°C". The more stringent the conditions of hybridization are, the more frequently the polynucleotides highly 
homologous to the probe sequence are isolated. Therefore, it is preferable to conduct hybridization under stringent 

30 conditions. Examples of stringent conditions in the present invention are, washing conditions of "0.5x SSC, 0.1 % SDS, 
42°C", or alternatively, hybridization conditions of "6x SSC, 40% Formamide, 37°C H , and the washing at "0.2x SSC, 
55°C". 

[0042] One skilled in the art can suitably select various conditions, such as dilution ratios of SSC, formamide con- 
centrations, and temperatures to accomplish a similar stringency. 
35 [0043] However, the above-mentioned combinations of SSC, SDS and temperature conditions are indicated just as 
examples. Those skilled in the art can select the hybridization conditions with similar stringency to those mentioned 
above by properly combining the above-mentioned or other factors (for example, probe concentration, probe length 
and duration of hybridization reaction) that determines the stringency of hybridization. 

[0044] The amino acid sequences of polypeptides isolated by using the hybridization techniques usually have high 

40 identity to those of the polypeptides of the present invention, which are shown in Table 1 . The present invention en- 
compasses a polynucleotide comprising a nucleotide sequence that has a high identity to the nucleotide sequence of 
claim 1 (a). Furthermore, the present invention encompasses a peptide, or polypeptide comprising an amino acid 
sequence that has a high identity to the amino acid sequence encoded by the polynucleotide of claim 1 (b). The term 
"high identity" indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 70% 

4 5 or more. Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthermore, 97% 
or more, or 99% or more. The identity can be determined by using the BLAST search algorithm. 
[0045] As used herein, "percent identity" of amino acid sequences or nucleic acids is determined using the algorithm 
BLAST of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1 993). Such an algorithm is incorporated into 
the BLASTN and BLASTX programs of Altschul et al. (J. Mol. Biol.21 5:403-410, 1990). BLAST nucleotide searches 

50 are performed with the BLASTN program, for example, score = 100, wordlength = 12. BLAST protein searches are 
performed with the BLASTX program, for example, score = 50, wordlength = 3. When utilizing BLAST and Gapped 
BLAST programs, the default parameters of the respective programs are used. See http://www.ncbi.nlm.nih.gov. 
[0046] With the gene amplification technique (PCR) (Current Protocols in Molecular Biology, edit, Ausubel et al., 
(1987) John Wiley & Sons, Section 6.1-6.4)) using primers designed based on the nucleotide sequence (Table 1) or a 

55 portion thereof identified in the present Example, it is possible to isolate a polynucleotide fragment highly homologous 
to the polynucleotide sequence or a portion thereof and to obtain functionally equivalent polypeptide to a particular 
polypeptide identified in the present Example based on the isolated polynucleotide fragment. 
[0047] The present invention also provides a polynucleotide containing at least 15 nucleotides complementary to a 



47 



EP 1 308 459 A2 



polynucleotide comprising a nucleotide sequence of SEQ ID NOs shown in Table 1 or the complementary strand thereof. 
Herein, the term "complementary strand" is defined as one strand of a double strand DNA composed of A:T and G:C 
base pair to the other strand. Also, "complementary" is defined as not only those completely matching within a contin- 
uous region of at least 1 5 nucleotides, but also having a identity of at least 70%, favorably 80% or higher, more favorably 
5 90% or higher, and most favorably 95% or higher within that region. The identity may be determined using the algorithm 
described herein. 

[0048] Such a polynucleotide includes probes and primers used forthe detection and amplification of a polynucleotide 
encoding the inventive polypeptide. When used as a primer, the polynucleotide usually comprises 15 to 100 bp, and 
preferably of 15 to 35 bp. When used as a probe, the polynucleotide comprises the whole or a part of the sequence 

10 of a polynucleotide of the invention, and comprises at least 15 bp. When used as primers, such polynucleotides are 
complementary at the 3'-end, and restriction enzyme recognition sequences or tags can be added to the 5'-end. 
[0049] Furthermore, polynucleotides of the present invention include an antisense polynucleotide for suppressing 
the expression of a polypeptide of the invention, which comprises an amino acid sequence of SEQ ID NOs shown In 
Table 1 . To exert an antisense effect, an antisense polynucleotide has at least 15 bp or more, for example 50 bp or 

15 more, preferably 1 00 bp or more, and more preferably 500 bp or more, and usually has 3000 bp or less, and preferably 
2000 bp or less. Antisense polynucleotides can be used in the gene therapy of diseases caused by abnormalities of 
the polypeptides of the invention (abnormal function or abnormal expression). An antisense polynucleotide can be 
prepared, for example, by the phosphorothioate method ("Physicochemical properties of phosphorothioate oligodeox- 
ynucleotides." Stein (1988) Nucleic Acids Res. 16: 3209-3221) based on the sequence information of polynucleotide 

20 encoding a polypeptide of the invention (for example, the nucleotide sequences of SEQ ID NO: 1 to 1 970). 

[0050] The polynucleotides or antisense polynucleotides of the present invention can be used in, for example, gene 
therapy. As target diseases, for example, cancers or various inflammatory diseases may be preferable. These mole- 
cules can be used for gene therapy, for example, by administrating them to patients by the in vivo or ex vivo method 
using virus vectors such as retrovirus vectors, adenovirus vectors, and adeno-related virus vectors, or non-virus vectors 

25 such as liposomes. 

[0051] The present invention also includes a partial peptide of the polypeptides of the invention. The partial peptide 
comprises a polypeptide generated as a result that a signal peptide has been removed from a secretory protein. If the 
polypeptide of the present invention has an activity as a receptor or a ligand, the partial peptide may function as a 
competitive inhibitor of the polypeptide and may bind to the receptor (or ligand). In addition, the present invention 

30 includes an antigen peptide for raising antibodies. For the peptides to be specific for the polypeptide of the invention, 
the peptides comprise at least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or 
more, and even more preferably 10 amino acids or more. The peptide can be used for preparing antibodies against 
the polypeptide of the invention, or competitive inhibitors of them, and also screening for a receptor that binds to the 
polypeptide of the invention. The partial peptides of the invention can be produced, for example, by genetic engineering 

35 methods, known methods for synthesizing peptides, or digesting the polypeptide of the invention with an appropriate 
peptidase. 

[0052] The present invention also relates to a vector into which a polynucleotide of the invention is inserted. The 
vector of the invention is not limited as long as it contains the inserted polynucleotide stably. For example, if E. coli is 
used as a host, vectors such as pBluescript vector (Stratagene) are preferable as a cloning vector. To produce the 

40 polypeptide of the invention, expression vectors are especially useful. Any expression vector can be used as long as 
it is capable of expressing the polypeptide in vitro, in E. coli, in cultured cells, or in vivo. For example, pBEST vector 
(Promega) is preferable for in vitro expression, pET vector (Invitrogen) for E. coli, pME1 8S-FL3 vector (GenBank Ac- 
cession No. AB009864) for cultured cells, and pME18S vector (Mol. Cell. Biol. (1 988) 8: 466-472) for in vivo expression. 
To insert the polynucleotide of the invention, ligation utilizing restriction sites can be performed according to the standard 

45 method (Current Protocols in Molecular Biology (1987) Ausubel etal. edit, John Wiley & Sons, Section 11.4-11.11). 
[0053] Recently, the technique of GATEWAY™ system (Invitrogen), which is an expression vector construction sys- 
tem for polypeptide expression, has been developed (Experimental Medicine, Vol. 1 8, No. 1 9 (December), p271 6-271 7, 
2000). This system includes two types of site-specific recombinases (BP CLONASE™ and LR CLONASE™) derived 
from lambda phage and uses BP CLONASE™ -specific recombination sites for an Entry Vector and LR CLONASE™- 

50 specific recombination sites for a Destination Vector, which may comprise atag useful for polypeptide purification. With 
this system, an expression vector can be obtained by using homologous recombination. 

[0054] First, a polynucleotide fragment of interest is inserted into the entry vector using the first recombination. Then, 
the secondary recombination is allowed to take place between the entry vector, where the polynucleotide fragment of 
interest has been inserted, and the destination vector. Thus, the expression vector can be prepared rapidly and highly 
55 efficiently. With the above-mentioned typical method using restriction enzyme and ligase reactions, the step of expres- 
sion vector construction and expression of polypeptide of interest takes about 7 to 10 days. However, with the GATE- 
WAY™ system, the polypeptide of interest can be expressed and prepared in only 3 to 4 days. Thus, the system 
ensures a high-throughput functional analysis for expressed polypeptides (http://biotech.nikkeibp.co.jp/netlink/lto/gate- 
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way/). 

[0055] The present invention also relates to a transf ormant carrying the vector of the invention. Any cell can be used 
as a host into which the vector of the invention is inserted, and various kinds of host cells can be used depending on 
the purposes. For strong expression of the polypeptide in eukaryotic cells, COS cells or CHO cells can be used, for 
5 example. 

[0056] Introduction of the vector into host cells can be performed, for example, by calcium phosphate precipitation 
method, el ectropo ration method (Current Protocols in Molecular Biology (1 987) Ausubel et al. edit, John Wiley & Sons, 
Section 9.1-9.9), lipofectamine method (GIBCO-BRL), or microinjection method, etc. 

[0057] Further, a polynucleotide containing at least 15 nucleotides comprising a nucleotide sequence of any one of 

10 the polynucleotides comprising the nucleotide sequences of SEQ ID NOs shown in Table 1 or the complementary 
strand thereof can be used not only as a primer for synthesizing full-length cDNAs but also for testing and diagnosing 
the abnormalities of the polypeptide encoded by the full-length cDNA of the present invention. For example, by utilizing 
polymerase chain reaction (genomic DNA-PCR, or RT-PCR) using the polynucleotide of the invention as a primer, 
polynucleotide encoding the polypeptide of the invention can be amplified. It is also possible to obtain the regulatory 

is region of expression in the 5'-upstream by using PCRor hybridization sincethetranscriptionstartsitewithin the genomic 
sequence can be easily specified based on the 5'-end sequence of the full-length cDNA. The obtained genomic region 
can be used for detection and/or diagnosis of the abnormality of the sequence by RFLP analysis, SSCP, or sequencing. 
Especially, in the case where expression of the mRNA of the present invention varies according to a specific disease, 
analysis of the amount of expression of the mRNA using the polynucleotide of the present invention as a probe or a 

20 primer enables detection and diagnosis of the disease. 

[0058] The present invention also relates to antibodies that bind to the polypeptide of the invention. There are no 
limitations in the form of the antibodies of the invention. They include polyclonal antibodies, monoclonal antibodies, or 
their portions that can bind to an antigen. They also include antibodies of all classes. Furthermore, special antibodies 
such as humanized antibodies and chimeric antibodies are also included. 

25 [0059] The polyclonal antibody of the invention can be obtained according to the standard method by synthesizing 
an oligopeptide corresponding to the amino acid sequence and immunizing rabbits with the peptide (Current Protocols 
in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .12-11 .13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the polypeptide expressed in E. coli, 
immunizing mice with the polypeptide, and producing a hybridoma cell by fusing the spleen cells and myeloma cells 

30 (Current Protocols in Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .4-11 .11). 

[0060] The antibody binding to the polypeptide of the present invention can be used for purification of the polypeptide 
of the invention, and also for detection and/or diagnosis of the abnormalities of the expression and structure of the 
polypeptide. Specifically, polypeptides can be extracted, for example, from tissues, blood, or cells, and the polypeptide 
of the invention is detected by Western blotting, immunoprecipitation, or ELISA, etc. for the above purpose. 

35 [0061] Furthermore, the antibody binding to the polypeptide of the present invention can be utilized for treating the 
diseases that associates with the polypeptide of the invention. If the antibodies are used for treating patients, human 
antibodies, humanized antibodies, or chimeric antibodies are preferable in terms of their low antigenicity. The human 
antibodies can be prepared by immunizing a mouse whose immune system is replaced with that of human (e.g., see 
"Functional transplant of megabase human immunoglobulin loci recapitulates human antibody response in mice" Men- 

40 dez, M.J. et al. (1997) Nat. Genet. 15: 146-156). The humanized antibodies can be prepared by recombination of the 
hypervariable region of a monoclonal antibody (Methods in Enzymology (1991) 203: 99-121). 
[0062] A cDNA of the present invention encodes, for example, an amino acid sequence of a protein that is predicted 
to have the following function. The use of the amino acid sequences of the polypeptides encoded by the cDNAs of the 
present invention enables predicting that the polypeptides have the following functions. It can be predict, from the 

45 results of homology search of SwissProt, GenBank, UniGene, or nr, that these polypeptides have such functions. 
Specifically, for instance, as shown in Examples, searching for a known gene or polypeptide that is homologous to the 
partial sequence of the full-length cDNA of the invention (1 970 clone) and referring the function of the gene and of the 
polypeptide encoded by the gene make it possible to predict the function of the polypeptide encoded by the cDNA of 
the invention. In this way, each of 1 078 clones out of the 1 970 full-length cDNA clones of the invention was predicted 

50 to encode a polypeptide that was classified into the following categories. 



Secretory and/or membrane protein (516 clones) 

Glycoprotein-related protein (121 clones) 

Signal transduction-related protein (88 clones) 

Transcription- related protein (143 clones) 

Disease-related protein (331 clones) 

Enzyme and/or metabolism-related protein (219 clones) 

Cell division- and/or cell proliferation-related protein (44 clones) 
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Cytoskeleton-related protein (80 clones) 
Nuclear protein and/or RNA synthesis- related protein (70 clones) 
Protein synthesis- and/or transport-related protein (20 clones) 
Cellular defense-related protein (10 clones) 
5 Development and/or differentiation-related protein (1 9 clones) 

DNA- and/or RNA-binding protein (168 clones) 
ATP- and/or GTP-binding protein (93 clones) 

[0063] The functions of the polypeptides encoded by the cDNAs of the present invention can be predicted by as- 

10 sessing the presence of signal sequence, transmembrane region, nuclear translocation signal, glycosylation signal, 
phosphorylation site, and zinc finger motif, SH3 domain, etc. in the amino acid sequences. The programs, PSORT 
(Nakai K., and Kanehisa M. (1992) Genomics 14; 897-911), SOSUI (Hirokawa T et al. (1998) Bioinformatics 14: 
378-379) (Mitsui Knowledge Industry), and MEMSAT (Jones D.T., Taylor W.R., and Thornton J.M. (1994) Biochemistry 
33: 3038-3049) can be used to predict the existence of the signal sequence or transmembrane region. Alternatively, 

15 a partial amino acid sequence of the polypeptide is fused with another polypeptide such as GFP, the fusion polypeptide 
is transfected into cultured cells, and the localization is analyzed to predict the function of the original polypeptide. 
[0064] Based on the determined nucleotide sequences of the full-length cDNAs obtained in the present invention, it 
is possible to predict more detailed functions of the polypeptides encoded by the cDNA clones, for example, by search- 
ing the databases such as GenBank, Swiss-Prot, UniGene, and nr for homologies of the cDNAs; or by searching the 

20 amino acid sequences deduced from the full-length cDNAs for signal sequences by using software programs such as 
PSORT, for transmembrane regions by using software programs such as SOSUI or for motifs by using software pro- 
grams such as Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml) and PROSITE (http://www.expasy.ch/ 
prosite/). As a matter of course, the functions are often predictable by using partial sequence information (preferably 
300 nucleotides or more) instead of the full-length nucleotide sequences. However, the result of the prediction by using 

25 partial nucleotide sequence does not always agree with the result obtained by using full-length nucleotide sequence, 
and thus, it is needless to say that the prediction of function is preferably performed based on the full-length nucleotide 
sequences. 

[0065] GenBank, Swiss-Prot, UniGene and nr databases were searched for homologies of the full-length nucleotide 
sequences of the 1970 clones (see Example 6). The amino acid sequences deduced from the full-length nucleotide 
30 sequences were searched for functional domains by PSORT, SOSUI and Ram. Prediction of functions of polypeptides 
encoded by the clones and the categorization thereof were performed based on these results obtained. The catego- 
rization was carried out by the following method. 

[1] Firstly, the cDN A clones were classified into the above-mentioned Afunctional categories based on the results 
35 of annotation-based categorization (using the keywords in the case of Swiss-Prot hit data; using Definition or 

Reference information in the case of GenBank, UniGene, or nr hit data), and the signal sequence search of the 
deduced ORFs by PSORT and the transmembrane region search by SOSUI. 

[2] Secondly, clones which had been unassignable to the categories by the method of [1 ] were searched for func- 
tional domains and/or motifs by Pfam. Based on the results, the clones were additionally classified into the above- 
40 mentioned 14 types of categories when they had a functional domain and/or motif assignable to any one of the 

categories. 

[0066] The following 51 6 clones presumably belong to secretory and/or membrane proteins. 

ADRGL20020290, ADRGL20021910, ADRGL20036380, ADRGL20036840, ADRGL2005961 0, ADRGL20063770, 

45 ADRGL20066770, ASTR020010010, ASTRO20020240, ASTRO20045840, ASTRO20053430, ASTRO20055530, 
ASTRO20055570, ASTRO20055930, ASTRO2008B950, ASTRO20091180, BNGH420021 680, BNGH420023870, 
BNGH420046790, BNGH420052350, BNGH420059680, BNGH420075940, BNGH420077980, BRACE1 0000510, 
BRACE20051930, BRACE20052530, BRACE20054080, BRACE20066360, BRACE20068710, BRACE200690 
BRACE20069110, BRACE 20 194670, BRACE20204670, BRACE20216950, BRAMY1 0001 730, BRAMY20003880, 

50 BRAMY20013670, BRAMY20024790, BRAMY20027390, BRAMY20028530, BRAMY20035380, BRAMY20044920, 
BRAMY20045210, BRAMY20047560, BRAMY20050940, BRAMY20053910, BRAMY20055760, BRAMY20072440, 
BRAMY20083820, BRAMY20089770, BRAMY20091230, BRAMY20094890, BRAMY20096930, BRAMY2011841 0, 
BRAMY201 23400, B RAM Y201 25550, BRAMY20 127310, B RAM Y201 27760, BRAMY20 135720, BRAMY201 37360, 
BRAMY201 39440, BRAMY20 152510, B RAM Y201 94680, BRAMY20204270, BRAMY20225320, BRAMY20237190, 

55 BRAMY20245140, BRAMY20251750, BRAMY20285650, BRAWH20020470, BRAWH20021 91 0, BRAWH20026010, 
BRAWH20030000, BRAWH20039640, BRAWH 20055330, BRAWH20078620, BRAWH20093070, BRAWH201 85270, 
BRCAN 10000760, BRCAN1 0001 680, BRCAN20001480, BRCAN20004180, BRCAN20005230, BRCOC20000470, 
BRCOC20003600, BRHIP1 0000720, BRHIP1 0001040, BRHIP20000210, BRSSN20001 970, BRSSN20074640, 
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BRSSN20091190, CD34C20001 750, CTONG200 17490, CTONG20036990, CTONG20041260, CTONG20044870, 
CTONG20045500, CTONG20049480, CTONG20051450, CTONG20055850, CTONG20056150, CTONG20059130, 
CTONG20060040, CTONG20063770, CTONG20065680, CTONG20068360, CTONG20069320, CTONG20071 680, 
CTONG20076810, CTONG20078340, CTONG20079590, CTONG20083980, CTONG20084020, CTONG20085210, 

5 CTONG201 67750, CTONG201 68240, CTONG20 179890, CTONG20 183830, CTONG20 184830, DFNES2001 8000, 
DFNES20029660, DFNES20057660, DFNES20072990, DFNES20080880, FCBBF20018680, FCBBF20029280, 
FCBBF20032930, FCBBF20036360, FCBBF20054390, FCBBF30004340, FCBBF30022680, FCBBF30029250, 
FCBBF30042610, FCBBF30062490, FCBBF30075970, FCBBF30078600, FCBBF30091520, FCBBF30095410, 
FCBBF301 05440, FCBBF301 18670, FCBBF301 32660, FCBBF301 35890, FCBBF301 45670, FCBBF301 64510, 

10 FCBBF301 69870, FCBBF30171230, FCBBF30 172330, FCBBF30 177290, FCBBF301 79740, FCBBF301 95690, 
FCBBF301 97840, FCBBF30212210, FCBBF30223110, FCBBF30223210, FCBBF30225930, FCBBF30230610, 
FCBBF30260480, FCBBF30266510, FCBBF30287940, FCBBF50000610, FCBBF50004950, FEBRA20007820, 
FEBRA20018670, FEBRA20031280, FEBRA20031 81 0, FEBRA20038220, FEBRA20039260, FEBRA20040230, 
FEBRA20040560, FEBRA20046280, FEBRA20080860, FEBRA20084750, FEBRA20088810, FEBRA201 15930, 

15 FEBRA201 16650, FEBRA20121950, FEBRA20141 980, FEBRA201 77800, FEBRA20 182030, FEBRA20191720, 
HCHON1 0001 660, HCHON20015050, HEARX1 0001 490, HEART20031680, HHDPC10001140, HHDPC20051850, 
HHDPC20082790, HHDPC20088160, HLUNG20015070, HLUNG20015180, HLUNG20020850, HLUNG20029490, 
HLUNG20032460, HLUNG20033350, HLUNG20034970, HLUNG20037160, HLUNG20041540, HLUNG20042730, 
HLUNG20050760, HLUNG20052300, HLUNG20060670, HLUNG20065990, HLUNG20074330, HLUNG20081390, 

20 HLUNG20088750, HLUNG20092530, KIDNE20016360, KIDNE20083150, KIDNE20084030, KIDNE20084040, 
KIDNE20084800, KIDNE20086490, KIDNE20086660, KIDNE20094670, KIDNE20134130, KIDNE201 42900, 
KIDNE201 43200, KIDNE201 48080, KIDNE201 60960, KIDNE201 63710, KIDNE20169180, KIDNE201 82540, 
KIDN E201 861 70, KIDNE201 88630, KIDNE201 89960, LIVER20007750, LIVER2001 051 0, LIVER2001 0990, 
LIVER20026440, LIVER20030650, LIVER20038000, MESAN20007110, MESAN20008150, MESAN20021220, 

25 MESAN20027900, MESAN20058110, MESAN20059570, MESAN20060430, MESAN20067430, MESAN200B4150, 
MESAN20095220, NT2NE200 18740, NT2NE20021 860, NT2NE20039210, NT2NE20053230, NT2NE2005921O, 
NT2NE20064780, NT2NE20069580, NT2NE20080770, NT2NE20082130, NT2NE20092950, NT2NE20140130, 
NT2NE201 45250, NT2NE201 46510, NT2NE20 152620, NT2NE20 167660, NT2NE20181800, NT2R 12001 6240, 
NT2RI20021200, NT2RI20033920, NT2RP70003110, NT2RP70027790, NT2RP70031070, NT2RP70031480, 

30 NT2RP70056690, NT2RP70087140, NTONG20034540, NTONG 20053630, OCBBF20000740, OCBBF2001 2520, 
OCBBF201 09780, OCBBF201 10210, OCBBF201 10730, OCBBF201 12280, OCBBF201 18720, OCBBF201 20010, 
OCBBF201 23200, OCBBF201 55030, OCBBF20 165900, OCBBF201 65910, OCBBF20 170350, OCBBF201 76650, 
OCBBF201 85630, OCBBF20191950, PANCR1 0000860, PEBLM20001800, PLACE50001290, PLACE60004260, 
PLACE60006300, PLACE60053280, PLACE 60055590, PLACE60056910, PLACE60057860, PLACE60061370, 

35 PLACE60064740, PLACE60070500, PLACE60087680, PLACE 60 104630, PLACE601 07010, PLACE601 1 3340, 
PLACE601 38840, P LAC E60 154450, PLACE60 184870, PROST1 0001 100, PROST200111 60, PROST2001 4150, 
PROST20035830, PROST20045700, PROST20 050390, PROST20065100, PROST20073280, PROST20082430, 
PROST20084680, PROST20084720, PROST20099090, PROST20 105450, PROST201 06060, PROST201 08850, 
PROST20110120, PROST20114100, PROST201 46590, PROST2015251 0, PROST201 68600, PUAEN1 0000870, 

40 SKMUS20006790, SKMUS20020770, SKMUS20073150, SKMUS20091900, SKNMC20006350, SKNSH20094350, 
SMINT20006090, SMINT20008110, SMINT20024140, SMINT20028840, SMINT20045470, SMINT20077960, 
SMINT20081330, SMINT20086250, SMINT20088440, SMINT20088690, SMINT200921 60, SPLEN2001 51 00, 
SPLEN20017610, SPLEN20017810, SPLEN20024190, SPLEN20024620, SPLEN20054500, SPLEN20058180, 
SPLEN20063890, SPLEN20073880, SPLEN20080070, SPLEN20090880, SPLEN20101950, SPLEN201 04690, 

45 SPLEN20105100, SPLEN201 08000, SPLEN20110180, SPLEN20110860, SPLEN201 18050, SPLEN20121790, 
SPLEN201 25230, SPLEN201 36700, SPLEN201 38600, SPLEN20139100, SPLEN201 75920, SPLEN201 77400, 
SPLEN201 82850, SPLEN201 83020, SPLEN201 83950, SPLEN201 90080, SPLEN201 90770, SPLEN201 93230, 
SPLEN201 93490, SPLEN20 193790, SPLEN20201830, SPLEN20204670, TESOP1 0000350, TESTI1 00001 90, 
TESTI20006160, TESTI20029100, TESTI20031310, TESTI20032770, TESTI20038240, TESTI20043130, 

50 TESTI20043220, TESTI20045390, TESTI20046540, TESTI20046870, TESTI20047370, TESTI20050400, 
TESTI20051 200, TESTI20051 730, TESTI20053260, TESTI20053780, TESTI20057200, TESTI20057590, 
TESTI20059080, TESTI20061 200, TESTI200621 20, TESTI20063330, TESTI2006341 0, TESTI20063600, 
TESTI20066330, TESTI20068530, TESTI20070400, TESTI20070740, TESTI20073460, TESTI20086840, 
TESTI20095200, TESTI20095440, TESTI20095880, TESTI201 00090, TESTI201 02390, TESTI201 0591 0, 

55 TESTI20113940, TESTI20116120, TESTI20121040, TESTI20121710, TESTI20131440, TESTI20 142540, 
TESTI201 49880, TESTI20151800, TESTI20 162780, TESTI201701 70, TESTI20 173050, TESTI201 82760, 
TESTI201 83680, TESTI201 84750, TESTI201 861 1 0, TESTI201 98540, TESTI201 991 1 0, TESTI20202830, 
TESTI20204260, TESTI2021 0030, TESTI2021 4630, TESTI2021 9110, TESTI20244730, TESTI20245600, 
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TESTI20245860, TESTI2024641 0, TESTI20251 61 0, TESTI2025791 0, TESTI20260640, TESTI20261 040, 
TESTI20262150, TESTI20262940, TESTI2026491 0, TESTI20271 790, TESTI20278280, TESTI20282420, 
TESTI20282900, TESTI20286590, THYMU20007020, THYMU20012020, THYMU200 17270, THYMU20020800, 
THYMU20025480, THYMU20028150, THYMU20030690, THYMU20034790, THYMU20046350, THYMU20046770, 

5 THYMU20050010, THYMU20052830, THYMU20054800, THYMU20055740, THYMU20055760, THY MU200 62770, 
THYMU20078240, THYMU20079690, THYMU20083390, THYMU20087270, THYMU20 100940, THYMU201 15380, 
THYMU201 37050, THYMU201 37570, THYMU201 43230, THYMU20150190, THYMU201 53210, THYMU20 154790, 
THYMU201 63600, THYMU20171580, THYMU20 178440, THYMU201 85470, TRACH20011010, TRACH20011540, 
TRACH20021380, TRACH20073990, TRACH20081270, TRACH20090060, TRACH201 49720, TRACH201 49740, 

10 TRACH201 59390, TRACH201 63470, TRACH20 165330, TRACH201 67090, TRACH201 73680, TRACH201 90460, 
UMVEN 10001 380, UTERU20035770, UTERU20040150, UTERU20045200, UTERU20064120, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20089300, UTERU20089620, UTERU20095100, UTERU20099040, 
UTERU201 03200, UTERU201 25810, UTERU201 27030, UTERU20127150, UTERU201 39760, UTERU20 188840 
[0067] The following 1 21 clones presumably belong to glycoprotein-related proteins. 

15 ADRGL20020290, ADRGL20036840, ADRGL20059610, ADRGL20066770, ASTRO20055570, BNGH420046790, 
BNGH420077980, BRACE20051930, BRACE20069000, BRACE20204670, BRACE20216950, B RAM Y200 13670, 
BRAMY20089770, BRAMY20251210, BRAWH20039640, BRCAN 10000760, BRCAN20005230, BRCOC20003600, 
CD34C20001750, CTONG20017490, CTONG20036990, CTONG20045500, CTONG20059130, CTONG20079590, 
CTONG20085210, CTONG20 184830, DFNES20018000, DFNES20080880, FCBBF30004340, FCBBF30029250, 

20 FCBBF30062490, FCBBF30091520, FCBBF301 6451 0, FCBBF30171230, FCBBF301 95690, FCBBF30223210, 
FEBRA20038220, HCHON20015050, HLUNG20015070, HLUNG20032460, HLUNG20037160, HLUNG20041540, 
KIDNE20142900, KIDNE20169180, KIDNE201861 70, KIDNE201 89960, MESAN20021220, MESAN20058110, 
NT2NE20064780, NT2NE20140130, NT2NE20 155650, NT2RP70056690, NTONG20053630, OCBBF20000740, 
OCBBF20012520, OCBBF20110210, OCBBF201 20010, OCBBF201 65900, OCBBF201 65910, OCBBF20191950, 

25 PEBLM20001800, PLACE60004260, PLACE60087680, PLACE601 13340, P LAC E601 84870, PROST20033240, 
PROST20099090, PROST201 08850, PROST20 146590, SKMUS20073150, SKNMC20006350, SMINT20028840, 
SMINT20056230, SMINT20083290, SMINT20091190, SPLEN20024620, SPLEN20063890, SPLEN20080070, 
SPLEN20090880, SPLEN201 18050, SPLEN20139100, SPLEN201 83020, SPLEN20201830, TEST1 100001 90, 
TESTI2003131 0, TESTI20043990, TESTI20045390, TESTI20051 200, TESTI20057590, TESTI20059080, 

30 TESTI20066330, TESTI20086840, TESTI201 00090, TESTI201 05910, TESTI201 54370, TESTI201 64210, 
TESTI201 82760, TESTI201 84750, TESTI201 991 1 0, TESTI2021 9110, TESTI20220230, TESTI20245600, 
TESTI20251610, TESTI20257910, TESTI20286590, THYMU20024500, THYMU20028150, THYMU20052830, 
THYMU20062770, THYMU20099060, THYMU201 70080, THYMU201 78440, TRACH2001101 0, TRACH20011540, 
TRACH20021380, TRACH201 49740, TRACH201 70860, TRACH201 90460, UTERU20086530, UTERU20087070, 

35 UTERU201 27030 

[0068] The following 88 clones presumably belong to signal transduction-related proteins. 

ASTRO20050810, ASTRO20052420, ASTRO20085080, ASTRO20090680, BNGH420008150, BNGH4200 15760, 
BNGH420035290, BNGH420086030, BRAMY20035830, BRAMY20043630, BRAMY2011 8490, BRAMY20206340, 
BRAMY20244490, BRAMY20251210, BRAMY2 0263 000, BRAWH20093040, B R AW H20 190550, CTONG20004520, 

40 CTONG20029030, CTONG20030280, CTONG20063930, CTONG20070720, CTONG20 189000, FCBBF30001100, 
FCBBF30076310, FCBBF301 00080, FCBBF301 43550, FCBBF30153170, FCBBF301 75350, FCBBF30250980, 
FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, HLUNG20084790, KIDNE20089870, 
KIDNE201 60360, LIVE R2001 1640, MESAN20021130, MESAN20027240, MESAN20065990, NT2NE2001 8890, 
NT2NE20042550, NT2RP70075800, NTONG20043080, NTONG20048440, PLACE60071800, PROST20033240, 

45 PROST20052850, PROST20065790, PROST20075280, SKNSH20052400, SKNSH20057920, SMINT20006020, 
SMINT20035050, SPLEN20023540, SPLEN20039180, SPLEN20048800, SPLEN20049840, SPLEN20054160, 
SPLEN20085910, SPLEN20191020, SPLEN201 98390, TESTI20046490, TESTI20049060, TESTI20053070, 
TESTI20066650, TESTI20081 890, TESTI 20095770, TESTI201 06820, TESTI201 1 2860, TESTI201 45780, 
TESTI20150420, TESTI201 68880, TESTI20205250, TESTI20228120, TESTI 20244220, TESTI20244460, 

50 TESTI20251740, TESTI202611 60, TESTI 20264530, THYMU20013250, THYMU20039320, THYMU201 06990, 
THYMU201 45990, THYMU201 70080, THYMU201 76010, TRACH20188350 
[0069] The following 1 43 clones presumably belong to transcription-related proteins. 

ASTRO20038400, ASTRO20075150, BNGH420070370, BNGH420074600, BNGH420087430, BRACE20003310, 
BRACE20061620, BRAMY20001510, BRAMY20040580, BRAMY20076100, BRAMY20111 780, BRAWH20040680, 
55 BRAWH20050740, BRAWH20080580, BRAWH20082920, BRAWH20095900, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG200 74000, CTONG200 84660, CTONG201 86370, 
CTONG201 86520, DFNES20028170, DFNES20046840, DFNES20073320, FCBBF3000361 0, FCBBF3001 9140, 
FCBBF30021900, FCBBF30093170, FCBBF301 14850, FCBBF30 129010, FCBBF301 36230, FCBBF301 43550, 
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FCBBF30220050, FCBBF30228940, FCBBF30263080, FCBBF30285930, FCBBF500 03530, FEBRA20026820, 
FEBRA20027070, FEBRA20046510, FEBRA20057010, FEBRA20063720, FEBRA20 170240, HCHON1 00001 50, 
HCHON20002650, HEART20019310, HLUNG20014590, HLUNG20028110, HLUNG2006370 KIDNE201 40870, 
LIVER20006260, MESAN20016270, MESAN20038520, NT2NE20038870, NT2NE20053950, NT2NE20060750, 

5 NT2NE20061030, NT2NE20079670, NT2NE20082600, NT2RP70001120, NT2RP70029780, NT2RP70046410, 
NT2RP70057500, NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF20121910, 
OCBBF201 56450. OCBBF20 157970, OCBBF201 66900, OCBBF201 75360, OCBBF201 77540, PEBLM20003260, 
PLACE60052940, PLACE60066970, PLACE601 22970, PLACE601 50510, P LAC E60 177880, PROST20007170, 
PROST20024250, PROST20035170, PROST20 127450, PROST20151370, PROST201 55370, PUAEN1 0000650, 

10 PUAEN20003120, SMINT20011950, SMINT20026200, SMINT20030740, SMINT20039050, SMINT20044140, 
SMINT20086720, SPLEN20042200, SPLEN20043680, SPLEN20055600, SPLEN20059270, SPLEN20063250, 
SPLEN20098030, SPLEN201 97930, TESTI1 0001 570, TESTI20057430, TESTI20057840, TESTI2005981 0, 
TESTI20067480, TESTI20068790, TESTI 20075240, TESTI20079220, TESTI20088840, TESTI201 04090, 
TESTI201 22070, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI 201 84760, TESTI201 94880, 

15 TESTI201 97600, TESTI20228740, TESTI20254030, TESTI20254990, TESTI20266050, TESTI20274960, 
TESTI20282530, THYMU 10004280, THYMU20019260, THYMU20032820, THYMU20071120, THYMU20077250, 
TKIDN1 0001 920, UTERU20016580, UTERU20026620, UTERU20041630, UTERU20094830, UTERU20099510, 
UTERU20101150, UTERU201 69020, UTERU20177150, UTERU201 85220, UTERU20188670 
[0070] The following 331 clones presumably belong to disease-related proteins. 

20 ADRGL20020290, ADRGL20021 910, ADRGL20026790, ADRGL20036840, ADRGL2005961 0, ADRGL20066770, 
ASTRO20038400, ASTRO20052420. ASTRO20055570, ASTRO200751 50, ASTRO20088950, BNGH420008150, 
BNGH420086030, BRACE1 0000510, BRACE20003310, BRACE20069000, BRACE20097540, BRACE201 94670, 
BRACE20196180, BRACE20204670, BRACE20216950, BRAMY20003540, BRAMY20005080, BRAMY20035830, 
BRAMY20040580, BRAMY20043630, BRAMY20044920, BRAMY20051820, BRAMY20056620, BARMY20089770, 

25 BRAMY20111780, BRAMY201 5251 0, BRAMY201 90550, B RAM Y2022 1600, BRAMY20227860, BRAMY20274510, 
BRAWH20082920, BRAWH20093040, BRAWH20095900, BRAWH20 190530, BRAWH20191980, BR CAN 10000760, 
BRCAN1 0001 050, BRCAN20005230, BRSSN20066440, CTONG20004520, CTONG20029030, CTONG20042640, 
CTONG20045500, CTONG20052780, CTONG20053990, CTONG20070780, CTONG20070910, CTONG20072930, 
CTONG20083980, CTONG20084660, CTONG201 65750, CTONG20 169040, CTONG201 83430, CTONG201 83830, 

30 CTONG201 86290, CTONG201 89000, DFNES200 16470, DFNES20025500, DFNES20046840, DFNES20055400, 
DFNES20080880, FCBBF1 0000230, FCBBF20035490, FCBBF20066340, FCBBF30002270, FCBBF30002280, 
FCBBF30019140, FCBBF30053300, FCBBF30071500, FCBBF30072440, FCBBF3007631 0, FCBBF30080730, 
FCBBF301 00080, FCBBF301 15920, FCBBF301 18670, FCBBF30 129010, FCBBF301 32050, FCBBF301 36230, 
FCBBF30153170, FCBBF301 64510, FCBBF301 66220, FCBBF301 71230, FCBBF30 175350, FCBBF30 194550, 

35 FCBBF30220050, FCBBF30223210, FCBBF30259050, FCBBF30263080, FCBBF30275590, FCBBF50001650, 
FEBRA20027070, FEBRA20045380, FEBRA20046200, FEBRA20046510, FEBRA2005701 0, FEBRA20063720, 
FEBRA20078800, FEBRA20087550, FEBRA20088810, FEBRA20090160, FEBRA20092760, FEBRA20151750, 
FEBRA201 70240, FEBRA201 73330, FEBRA20191720, HCHON1 00001 50, HCHON200 15050, HEART20009590, 
HEART20022200, HEART20063100, HHDPC20081230, HLUNG20008460, HLUNG20014590, HLUNG20032460, 

40 HLUNG20063700, HLUNG20065990, HLUNG20069350, HLUNG20081530, HLUNG20082350, HLUNG20083330, 
HLUNG20085210, KIDNE20081170, KIDNE20084040, KIDNE20088240, KIDNE20089870, KIDNE201 33460, 
KIDNE201 34890, KIDNE20141700, KIDNE201 42900, KIDNE201 50730, KIDNE20 152440, KIDNE201 60360, 
KIDNE201 65390, KIDNE201 69180, KIDNE20 173430, KIDNE201 89960, LIVER20026440, MESAN20006200, 
MESAN20021130, MESAN20033220, MESAN20056890, MESAN20057240, MESAN20065990, MESAN20067430, 

45 MESAN20069530, NESOP20004520, NT2NE200 18890, NT2NE20026200, NT2NE20037050, NT2NE20053950, 
NT2NE20061030, NT2NE20111190, NT2NE201 17580, NT2NE201 19980, NT2NE20140130, NT2NE20141040, 
NT2RI20093010, NT2RP70003110, NT2RP70046410, NT2RP70075300, NTONG20032100, NTONG20034540, 
OCBBF20000740, OCBBF20012520, OCBBF20111600, OCBBF201 20010, OCBBF201 56450, OCBBF201 57970, 
OCBBF20191950, PEBLM20001800, PEBLM20003260, PLACE60004260, PLACE6001262 PLACE60054230, 

50 PLACE60054870, PLACE60062660, PLACE60087680, PLACE60 184870, PROST20015210, PROST20024250, 
PROST20036350, PROST20050390, PROST20058860, PROST20063430, PROST20065790, PROST20084720, 
PROST20099090, PROST20 120070, PROST20 127450, PROST201 46590, PROST201 52510, PROST201 68600, 
PUAEN 10000650, PUAEN20003120, SKMUS20008730, SKMUS20017400, SKMUS20040440, SKMUS20073590, 
SKMUS20079150, SKNSH20009710, SMINT20002320, SMINT20007470, SMINT2000811 0, SMINT20011950, 

55 SMINT20016150, SMINT20026200, SMINT20030740, SMINT20049920, SMINT20077960, SMINT20083290, 
SMINT20086250, SMINT20089600, SMINT20091190, SPLEN20023540, SPLEN200241 90, SPLEN20042200, 
SPLEN20043680, SPLEN20055600, SPLEN20057830, SPLEN20059270, SPLEN20063890, SPLEN20073500, 
SPLEN20080070, SPLEN20085910, SPLEN20090880, SPLEN20098030, SPLEN201 18050, SPLEN201 36730, 
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SPLEN201 38600, SPLEN20139100, SPLEN201 39360, SPLEN201 80980, SPLEN201 87490, SPLEN201 93790, 
SPLEN20201 830, TESTI1 00001 90, TESTI20031 310, TESTI20035790, TESTI20041 630, TESTI20049060, 
TESTI20050720, TESTI20051 200, TESTI20057430, TESTI20057590, TESTI20059080, TESTI200621 20, 
TESTI20067480, TESTI20071 630, TESTI20099350, TESTI201 051 30, TESTI201 0591 0, TESTI201 08060, 

5 TESTI201 25920, TESTI201 30530, TESTI20131440, TESTI201 34680, TESTI 20 142540, TESTI20143180, 
TESTI201 50420, TESTI201 54370, TESTI201 6421 0, TESTI201 66670, TESTI201 68880, TESTI201 71 070, 
TESTI201 82760, TESTI201 84750, TESTI201 93080, TESTI201 94880, TESTI201 96970, TESTI201 97600, 
TESTI20201 760, TESTI202071 70, TESTI2021 91 1 0, TESTI20228740, TESTI20244430, TESTI20246480, 
TESTI20251 740, TESTI20252690, TESTI20254030, TESTI2025791 0, TESTI20258720, TESTI20266050, 

10 TESTI20271790, TESTI20274960, TESTI20282530, TESTI20286590, THYMU 10004280, THYMU20006020, 
THYMU20013250, THYMU20019260, THYMU20023560, THYMU20028150, THYMU20032820, THYMU20034400, 
THYMU20055460, THYMU20063650, THYMU20070250, THYMU20071120, THYMU20081110, THYMU20090230, 
THYMU20095920, THYMU20098350, THYMU20099060, THYMU201 20730, THYMU20121040, THYMU201 70080, 
THYMU201 85650, THYMU20191970, TKIDN 10000620, TKIDN1 0001 920, TRACH20011540, TRACH20091070, 

15 TRACH201 43710, TRACH201 70860, UTERU1 0001 060, UTERU20026620, UTERU20041630, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20099510, UTERU20101150, UTERU201 04310, UTERU201 27030, 
UTERU201 85220 

[0071] The following 21 9 clones presumably belong to the category of enzymes and/or metabolism-related proteins. 
ADRGL20059610, ASTRO20026320, ASTRO20050810, ASTRO20088950, BNGH420008150, BNGH420035290, 

20 BNGH420074600, BRACE20050870, BRACE20097540, BRACE20200770, BRACE20204670, BRACE20215410, 
BRAMY20003540, BRAMY20005080, BRAMY20027990, BRAMY20028620, BRAMY20044920, BRAMY20055760, 
BRAMY20056620, BRAMY20072870, BRAMY20093490, BRAMY20096930, BRAMY20118490, BRAMY201 25360, 
BRAMY201 43870, BRAMY20152510, BRAMY20231150, BRAMY20244490, BRAMY20251210, BRAWH20021 910, 
BRAWH20082920, BRAWH20093040, BRAWH20094900, BRAWH20183170, BRAWH201 88750, BRAWH201 90550, 

25 BRAWH20191980, BRCAN20005230, BRCOC20003600, CTONG20051100, CTONG20070910, CTONG20076810, 
CTONG20079590, CTONG20080140, CTONG20085210, CTONG201 86290, DFNES20063460, DFNES20080880, 
FCBBF20023490, FCBBF20066340, FCBBF30004340, FCBBF30019140, FCBBF30022680, FCBBF30029250, 
FCBBF30072440, FCBBF30076310, FCBBF30085560, FCBBF30091520, FCBBF30 107290, FCBBF301 25880, 
FCBBF301 32050, FCBBF30 143550, FCBBF30153170, FCBBF301 66220, FCBBF30171230, FCBBF301 75350, 

30 FCBBF30236670, FCBBF30260480, FEBRA20038220, FEBRA20040560, FEBRA20078800, FEBRA20090160, 
FEBRA201 72230, FEBRA201 73330, HCHON20000870. HCHON20002710, HEART1 0001 490, HEART20022200, 
HEART20047640, HEART20082570, HLUNG20011260, HLUNG20032460, HLUNG20041540, HLUNG20042730, 
HLUNG20054790, KIDNE20080690, KIDNE20083620, KIDNE20084040, KIDNE20147170, KIDNE201 52440, 
KIDNE20173150, KIDNE201861 70, KIDNE201 89960, LIVER20011 640, LIVER20026440, LIVER20055270, 

35 MESAN20021130, MESAN20033220, MESAN20038520, MESAN20057240, MESAN20058110, MESAN20065990, 
MESAN20095800, NT2NE20026200, NT2NE20042550, NT2NE2011 7580, NT2NE201 27900, NT2RI20093010, 
NT2RP70064570, NTONG20 034540, NTONG20043080, NTONG20053630, NTONG20053730, NTONG20058010, 
OCBBF201 20010, OCBBF201 67290, OCBBF20191950, PANCR1 0000860, PLACE60052940, PLACE60064180, 
PLACE60073090, PLACE60095600, PLACE601 84410, PLACE601 88630, PROST20007600, PROST20033240, 

^0 PROST20036350, PROST20 039300, PROST20050390, PROST20051310, PROST20052850, PROST20065790, 
PROST20075280, PROST20084720, PROST20099090, PROST201 08850, PROST201 52510, PUAEN20001520, 
PUAEN20002470, SKNMC20006350, SKNSH20057920, SMINT20008110, SMINT20049920, SMINT20094680, 
SPLEN20023540, SPLEN20024930, SPLEN20043680, SPLEN20048800, SPLEN20054500, SPLEN20057900, 
SPLEN20071820, SPLEN20080070, SPLEN20085910, SPLEN201 08000, SPLEN201 36730, SPLEN201 80980, 

45 TESTI20012080, TESTI20030200, TESTI20031310, TESTI20038240, TESTI20050720, TESTI20051200, 
TESTI20059080, TESTI200621 20, TESTI20066330, TESTI20076570, TESTI201 03690, TESTI201 051 30, 
TESTI201 06820, TESTI201 08060, TESTI201 12860, TESTI20121040, TESTI20 130530, TESTI20131440, 
TESTI201 68880, TESTI201 701 70, TESTI201 96690, TESTI201 96970, TESTI201 991 1 0, TESTI20205250, 
TESTI20212970, TESTI20222030, TESTI20226520, TESTI20227380, TESTI 20244460, TESTI20244730, 

50 TESTI20250630, TESTI20260640, TESTI20262940, TESTI 20264530, TESTI20285230, THYMU20006020, 
THYMU20013250, THYMU20034400, THYMU20039320, THYMU20055460, THYMU20055760, THYMU20063650, 
THYMU20066660, THYMU20070250, THYMU20087270, THYMU20096580, THYMU20 100940, THYMU201 10720, 
THYMU201 20240, THYMU201 20730, THYMU201 70230, TRACH20011010, TRACH20021380, TRACH20091070, 
TRACH201 13020, TRACH20 143710, TRACH201 64100, TRACH201 90460, UTERU20087070, UTERU20089620, 

55 UTERU201 0431 0, UTERU201 85220, UTERU201 88670 

[0072] The following 44 clones presumably belong to the category of cell division- and/or cell proliferation-related 
proteins. 

ASTRO20090680, BRACE20079370, BRAMY20234820, BRCAN1 0001 050, BRCAN20005410, CTONG20032930, 
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FCBBF20070950, FCBBF30002270, FCBBF30053300, FCBBF301 05860, FCBBF30 175350, FCBBF3021 5240, 
FCBBF30275590, FEBRA20045380, HLUNG20068120, KIDNE20 134890, KIDNE201 50730, MESAN20021 470, 
NT2NE20077250, NT2NE20 153620, NT2RP70030840, NTONG20053910, OCBBF20111370, OCBBF201 74580, 
PROST20063430, SKNMC1 0001 230, SMINT20028800, SPLEN20023540, SPLEN20057830, SPLEN201 39360, 
5 TESTI20031410, TESTI20057840, TESTI20065650, TESTI20066650, TESTI201 07320, TESTI201 08060, 
TESTI201 1 4480, TESTI201 34680, TESTI201 431 8 TESTI201 50920, TESTI20201 760, TESTI20278280, 
TESTI20284260, THYMU20097920 

[0073] The following 80 clones presumably belong to the category of cytoskeleton -related proteins. 
ADRGL20062330, ASTR020053430, BGGI1 20000670, BRACE20079370, BRAMY20038980, BRAMY20083330, 

10 BRAMY20094890, CTONG20004110, CTONG20032930, CTONG20077760, CTONG20083980, CTONG201 69040, 
CTONG201 83430, DFNES20018000, FCBBF301 05860, FCBBF301 30410, FCBBF30 194550, FCBBF30201630, 
FCBBF30271 990, FEBRA20005040, FEBRA20046200, FEBRA20099860, HCHON20015050, HLUNG20081530, 
KIDNE20081170, NT2RP70001730, NT2RP70003110, NTONG20032100, OCBBF201 66890, OCBBF201 74890, 
PLACE60054870, PLACE60055590, PLACE60071800, PLACE601 18810, PROST20015210, PROST20097840, 

15 PROST201 20070, PROST20 146590, SKMUS20007260, SKMUS20008730, SKMUS2001 7400, SKMUS20073590, 
SMINT20062050, SMINT20074330, SMINT20077960, SPLEN20039180, SPLEN20049840, SPLEN20076470, 
SPLEN201 82990, SPLEN201 87490, SPLEN201 95710, TESTI1 00001 90, TESTI20041 630, TESTI20057880, 
TESTI20058920, TESTI20060080, TESTI20064530, TESTI20064650, TESTI20065650, TESTI20067440, 
TESTI20071 1 30, TESTI20099350, TESTI201 1 2540, TESTI201 25280, TESTI201 3601 0, TESTI201 5331 0, 

20 TESTI201 75370, TESTI 20222460, TESTI20244430, TESTI20254030, TESTI20258720, THYMU20024500, 
THYMU20062610, THYMU20098350, TRACH20043360, TRACH20098510, TRACH20 149500, UTERU20089390, 
UTERU201 22520, UTERU201 68960 

[0074] The following 70 clones presumably belong to the category of nuclear proteins and/or RNA synthesis- related 
proteins. 

25 ASTRO20026320, BRACE20050870, BRACE20200770, BRAMY201 34050, BRAWH20063010, BRAWH20093040, 
BRAWH201 74330, BRAWH20 176850, CTONG20042640, FCBBF20023490, FCBBF20035490, FCBBF20070950, 
FCBBF30002270, FCBBF30048420, FCBBF30080730, FCBBF301 15920, FCBBF30236670, FEBRA20035240, 
FEBRA20092760, FEBRA201 73330, HHDPC20081230, HLUNG20011460, HLUNG20068120, KIDNE20089870, 
KIDNE201 50730, MESAN20056890, MESAN20057240, NT2NE20037050, NT2NE20 167660, NT2RP70031070, 

30 NTONG20053730, PLACE600641 80, PLACE60095600, PROST20016760, PROST20051310, PROST20058860, 
PROST201 52510, PUAEN20002470, SKMUS20079150, SKNSH20030640, SPLEN20023850, SPLEN20057830, 
SPLEN201 39360, SPLEN201 90430, TESTI20006830, TESTI20030200, TESTI20031410, TESTI20035790, 
TESTI200621 20, TESTI20065650, TESTI20081 890, TESTI201 50920, TESTI201 5331 0, TESTI2020 1 760, 
TESTI20212970, TESTI20227380, TESTI20251 740, TESTI20256560, TESTI20260640, TESTI202701 30, 

35 TESTI20284260, TESTI20285230, THYMU20021090, THYMU 20049060, THYMU20066660, THYMU20081110, 
THYMU20090230, THYMU201 20240, UTERU 10001 060, UTERU201 04310 

[0075] The following 20 clones presumably belong to the category of protein synthesis- and/or protein transport- 
related proteins. 

BRAMY20038980, BRAMY20274510, CTONG20008190, CTONG20033610, FCBBF20018680, FEBRA20090220, 
40 KIDNE20141700, NT2NE201 67660, NTONG20055200, PLACE60012620, PROST20036350, PROST20062820, 
SKMUS20040440, SMINT20000070, SPLEN201 80980, TESTI20055680, TESTI 20067440, TESTI201 07240, 
THYMU20096580, THYMU20121040 

[0076] The following 1 0 clones presumably belong to the category of cellular defense-related proteins. 
ASTRO20089600, BRAMY201 17670, FEBRA20087550, HLUNG20081 390, MESAN20057240, NTONG20031 580, 
45 PROST20007600, SPLEN20023850, SPLEN20043680, TESTI20261680 

[0077] The following 1 9 clones presumably belong to the category of development and/or differentiation-related pro- 
teins. 

BRACE20061620, BRACE20200770, BRAMY20013670, CTONG2001 7490, CTONG20020950, HCHON1 0000150, 
MESAN20021470, OCBBF201 65910, PROST201 55370, PUAEN20002470, TESTI20079220, TESTI20079980, 
50 TESTI201 66670, TESTI20 184760, TESTI20252690, TRACH20040390, UTERU20089620, UTERU20094830, 
UTERU201 69020 

[0078] The following 1 68 clones presumably belong to the category of DNA- and/or RNA-binding proteins. 
ASTRO20038400, BGG1 12001 0750, BNGH420070370, BRACE20003310, BRACE20061620, BRAMY20001510, 
BRAMY20040580, BRAMY20076100, BRAMY20111780, BRAMY20274510, BRAWH20040680, BRAWH20050740, 
55 BRAWH20063010, BRAWH20080580, BRAWH20095900, BR AW H20 174330, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG201 65750, CTONG20 186370, 
CTONG201 86520, DFNES20046840, DFNES20073320, FCBBF20035430, FCBBF20070950, FCBBF30002270, 
FCBBF30003610, FCBBF30019140, FCBBF30021900, FCBBF30048420, FCBBF30080730, FCBBF300931 70, 
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FCBBF301 14850, FCBBF301 29010, FCBBF30 136230, FCBBF30220050, FCBBF30228940, FCBBF30236670, 
FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, FEBRA20027070, FEBRA20035240, 
FEBRA20046510, FEBRA2005701 0, FEBRA20063720, FEBRA20087550, FEBRA20092760, FEBRA20 170240, 
FEBRA201 77800, HCHON20002650, HEART20019310, HEART20063100, HHDPC20081230, HLUNG20011460, 

5 HLUNG20014590, HLUNG20028110, HLUNG20063700, HLUNG20068120, KIDNE201 40870, LIVER20006260, 
MESAN20016270, MESAN20056890, MESAN20057240, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20079670, NT2NE20082600, NT2NE20087270, NT2RP70029780, NT2RP70046410, NT2RP70057500, 
NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF20121910, OCBBF20 156450, 
OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF20 177540, PEBLM 10001470, PEBLM2Q003260, 

10 PLACE60066970, PLACE601 22970, PLACE601 77880, PROST20007170, PROST20024250, PROST200351 70, 
PROST20051310, PROST20058860, PROST20151370, PROST20 155370, PUAEN20003120, SMINT20011950, 
SMINT20030740, SMINT20039050, SMINT20044140, SMINT20086720, SPLEN20042200, SPLEN20043680, 
SPLEN20055600, SPLEN20059270, SPLEN20063250, SPLEN20139360, SPLEN20 190430, TESTI1 0001 570, 
TESTI20006830, TESTI20030200, TESTI20031410, TESTI20035790, TESTI20057430, TESTI2005981 0, 

15 TESTI20062120, TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, 
TESTI201 04090, TESTI201 34970, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, 
TESTI201 97600, TESTI20201 760, TESTI2021 2970, TESTI20227380, TESTI20228740, TESTI20246480, 
TESTI20254030, TESTI20254990, TESTI20266050. TESTI20268240, TESTI202701 30, TESTI20274960, 
TESTI20282530, TESTI20284260, TESTI20285230, THYMU 10004280, THYMU200 19260, THYMU20023560, 

20 THYMU20032820, THYMU20049060, THYMU20066660, THYMU20071120, THYMU20077250, THYMU20081110, 
THYMU20090230, TKIDN1 0001 920, TRACH201 08240, UTERU1 0001 060, UTERU20026620, UTERU20041630, 
UTERU20094830, UTERU20099510, UTERU201 01150, UTERU201 69020, UTERU20177150, UTERU20188670 
[0079] The following 93 clones presumably belong to the category of ATP- and/or GTP-binding proteins. 
ASTRO20026320, BNGH420035290, BRACE20050870, BRACE20079370, BRACE20200770, BRAMY20055760, 

25 BRAMY201 18490, BRAMY20244490, BRAMY20251210, BRAWH20093040, BRAWH20 190550, BRCAN1 0001 050, 
BRCOC20003600, CTONG20008190, CTONG20030280, CTONG20032930, CTONG20 176040, CTONG20 184830, 
FCBBF20023490, FCBBF30019140, FCBBF30076310, FCBBF301 05860, FCBBF30 175350, FCBBF30201630, 
FCBBF30236670, FEBRA20005040, FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, 
HLUNG20052300, KIDNE200811 70, KIDNE201 34890, LIVER20030650, LIVER20055270, MESAN20065990, 

30 NT2NE20042550, NTONG20043080, NTONG20055200, OCBBF201 82060, PLACE60054870, PLACE60064180, 
PLACE60095600, PLACE 60 140640, PROST20015210, PROST20 033240, PROST20036350, PROST20051310, 
PROST20052850, PROST20062820, PROST20075280, PROST20 120070, PUAEN20002470, SKNSH20052400, 
SKNSH20057920, SMINT20008110, SPLEN20023850, SPLEN20043680, SPLEN20049840, SPLEN201 36730, 
SPLEN201 80980, SPLEN201 93790, TESTI20055680, TESTI20058920, TESTI20060080, TESTI20064650, 

35 TESTI20071130, TESTI20099350, TESTI201 06820, TESTI201 12860, TESTI20 134680, TESTI20 136010, 
TESTI201 431 80, TESTI201 75370, TESTI2021 2970, TESTI20222460, TESTI20227380, TESTI20244220, 
TESTI20244460, TESTI20264530, THYMU20013250, THYMU20039320, THYMU20062610, THYMU20066660, 
THYMU20087270, THYMU20096580, THYMU201 00940, THYMU201 76010, TRACH20043360, TRACH20098510, 
TRACH201 13020, UTERU201 85220, UTERU201 88670 

40 [0080] Among the clones other than the ones shown above, BNGH42003641 0, FCBBF30257370 are clones which 
were predicted to highly possibly belong to the category of secretory protein and/or membrane protein based on the 
result of domain search by Pfam. 
SMINT20044730, TESTI201 40970 

[0081] The two clones shown above are clones which were predicted to highly possibly belong to the category of 

45 glycoprotein-related protein based on the result of domain search by Pfam. 

BRACE20055560, CTONG20046690, DFNES20043710, FCBBF30005500, MESAN20030350, MESAN20030370, 
PLACE60074820, TESTI20058350, TESTI20106170, TRACH201 31230, UTERU20000950 
[0082] The 11 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

50 ASTRO20010290, BRACE20099070, CTONG20007660, DFNES20076340, DFNES20094820, FCBBF301 25460, 
FCBBF301 42290, FCBBF301 69280, FEBRA20031000, NT2NE20026510, NT2RP70031340, P LAC E5000 1390, 
SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI20 166290, TESTI20259110, THYMU20 184550 
[0083] The 18 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription-related protein based on the result of domain search by Pfam. 

55 ADRGL20047770, ADRGL20079060, BRACE200 14450, BRACE20051600, BRAWH 20 185260, CTONG20033750, 
CTONG20070090, CTONG20 190290, FCBBF20020440, FCBBF30005360, FCBBF301 73960, FEBRA20031000, 
KIDNE20087880, LIVER20013890, MES AN 20030350, MESAN20030370, OCBBF20113110, PLACE60074820, 
PLACE60093380, PROST20028970, PROST20102190, SALGL1 0001070, SPLEN20006950, SPLEN20011350, 
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S PLEN20050090, TESTI20060830, TESTI200661 50, TESTI201 20900, TESTI201 3231 0, TESTI201 48380, 
TESTI201 62980, TESTI201 66290, TESTI20205100, THYMU201 12590, TRACH20029880 

[0084] The 35 clones shown above are clones which were predicted to highly possibly belong to the category of 
enzyme and/or metabolism-related protein based on the result of domain search by Pfam. 
5 PLACE60054820, TESTI201 97030 

[0085] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
cell division and/or cell proliferation -related protein based on the result of domain search by Pfam. 
ASTRO20006530, OCBBF2001 6390, TRACH20058000 

[0086] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
10 cytoskeleton -related protein based on the result of domain search by Pfam. 
BRACE20065470, PLACE60054820 

[0087] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
nuclear protein and/or RNA synthesis-related protein based on the result of domain search by Pfam. 
ASTRO20010290, BRACE20099070, BRAWH200 14590, CTONG20007660, DFNES2007634Q, DFNES20094820, 
15 FCBBF301 25460, FCBBF301 42290, FCBBF301 69280, FEBRA20031000, MESAN 20 034440, NT2NE2002651 0, 
NT2RP70031340, PLACE50001390, SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI201 66290, 
TESTI20259110, THYMU20 104480, THYMU201 84550 

[0088] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
DNA- and/or RNA-binding protein based on the result of domain search by Pfam. 
20 KIDNE20133880, MESAN20030350, MESAN20030370, TESTI200594B0 

[0089] The four clones shown above are clones which were predicted to highly possibly belong to the category of 
ATP- and/or GTP-binding proteins based on the result of domain search by Pfam. 

[0090] The 205 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
but have been predicted to have some functions based on homology search using their full-length nucleotide sequences 
25 and motif search in their estimated ORFs. Clone Name, Definition in the result of homology search or Motif Name in 
the motif search, demarcated by a double slash mark (//), are shown below. 
ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA). 
ADRGL20023920//ABC1 PROTEIN HOMOLOG PRECURSOR. 

ASTR020001910//Rattus norvegicus mRNA for annexin V-binding protein (ABP-10), partial cds. 
30 ASTR020009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN. 
ASTR020046280//PSU1 PROTEIN. 

ASTR020058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe) 
BNGH420024870//C2 domain// C2 domain// C2 domain 
BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN). 
35 BRACE20052430//Homo sapiens AMSH mRNA, complete cds. 
BRACE20054600//Xenopus laevis mRNA for Kielin, complete cds. 
BRACE2005981 0//TSC-22/dip/bun family 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 (FRAGMENT). 
BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds. 
40 BRAMY2001 6780//Proprotein convertase P-domain 
BRAMY20023640//UBX domain 

BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 
BRAMY20056840//UBE-1C2 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds. 
45 BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds. 
BRAMY20158550//CALMODULIN. 

BRAMY20223010//Mus musculus leucine-rich glioma-inactivated 1 protein precursor, (Lgi1) mRNA, complete cds. 
BRAMY20238630//TETRATRICOPEPTIDE REPEAT PROTEIN 4. 
BRAMY20245760//Araneus diadematus fibroin-4 mRNA, partial cds. 
50 BRAWH20047790//HMG (high mobility group) box 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain -associated protein 3A (Semcap3) mRNA, complete 
cds. 

BRSSN20005660//Bacterial type II secretion system protein 

BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds. 
55 CTONG20041150//Streptomyces ansochromogenes strain 7100 SanE (sanE) gene, complete cds. 
CTONG20066110//Homo sapiens DEME-6 mRNA, partial cds. 
CTONG20069420//Ribosomal protein S14p/S29e 
CTONG20071040//BETA CRYSTALLIN B2 (BP). 
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CTONG20074170//DENN (AEX-3) domain 
CTONG20083430//Nuclear transition protein 2 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN). 
CTONG20174290//TRICHOHYALIN. 
s CTONG201 74580//Homo sapiens mRNA for vascular Rab-GAP/TBC-containing protein complete cds. 
CTONG20180690//Collagen triple helix repeat (20 copies) 
CTONG20186550//cca3 protein - rat 
CTONG20188080//TPR Domain 

FCBBF10004760//Homo sapiens GAP-like protein (N61) mRNA, complete cds. 
10 FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN). 
FCBBF20041 380//SAM domain (Sterile alpha motif) 
FCBBF20043730//UBA domain 

FCBBF20056580//Mus muscuius NSD1 protein mRNA, complete cds. 
FCBBF20059660//TPR Domain 
15 FCBBF30019180//SERINE/THREONINE PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY 

SUBUNIT A, ALPHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISO- 
FORM). 

FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP140 mRNA, complete cds. 
FCBBF30035570//C2 domain 
20 FCBBF300797707/D-isomer specific 2-hydroxyacid dehydrogenases 

FCBBF30100120//Mus muscuius semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds. 

FCBBF30100410//Mus muscuius testis-specific Y-encoded-like protein (TspyM) mRNA, complete cds. 
FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
25 FCBBF301 38000//trg protein - rat 

FCBBF30157270//Rattus norvegicus PAPIN mRNA, complete cds. 

FCBBF301 61 780//gag gene protein p24 (core nucleocapsid protein)// Zinc knuckle 

FCBBF301 98670//dof protein - fruit fly (Drosophila melanogaster) 

FCBBF30222910//Mus muscuius Rap2 interacting protein 8 (RPIP8) mRNA, complete cds. 
30 FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cds. 
FCBBF30260210//Drosophila melanogaster KISMET-L long isoform (kis) mRNA, complete cds. 
FCBBF30282020//cca3 protein - rat 

FCBBF40000610//late gestation lung 2 protein [Rattus norvegicus]. 

FEBRA20029620//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat 
35 FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 

FEBRA20038330//Corticotropin-releasing factor family 

FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene). 

FEBRA20088610//CELLULAR RETINALDEHYDE-BINDING PROTEIN (CRALBP). 

FEBRA201 50420//HYPOTHETICAL 131.5 KDA PROTEIN C02F12.7 IN CHROMOSOME X. 
40 FEBRA201 75330//D- isomer specific 2-hydroxyacid dehydrogenases 

HEART10001420//Mus muscuius skm-BOP1 (Bop) mRNA, complete cds. 

HLUNG20024050//Rubredoxin 

HLUNG20030420//Mus muscuius mRNA for MAIL, complete cds. 
HLUNG20030490//Ambystoma tigrinum RPE65 protein mRNA, complete cds. 
45 HLUNG20033060//Homo sapiens GAP-like protein (N61 ) mRNA, complete cds. 

HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens]. 
HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2). 
HLUNG20051330//FHIPEP family 

HLUNG20070410//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
50 HLUNG200721 00//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds. 

HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52. 
KIDNE20027980//SAM domain (Sterile alpha motif) 

KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds. 
KIDNE20149780//NG28 [Mus muscuius] 
55 KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein. 

KIDNE20170400//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat// Protein kinase C terminal do- 
main// Rubredoxin 

KIDNE20189890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds. 
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LIVER20010760//Homo sapiens C-type lectin-like receptor-1 mRNA, complete cds. 
LIVER20040740//RETINAL-BINDING PROTEIN (RALBP). 
MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA 
MESAN20026870//PAN domain// TBC domain 
5 MESAN20090190//CEGP1 protein [Homo sapiens]. 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds. 
NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT). 
NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
10 NT2NE20108420//KES1 PROTEIN. 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds. 
NT2NE201 77210//Leishmania major partial ppgl gene for proteophosphogtycan. 
NT2RP7001 2830//CALPHOTIN. 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds. 
15 NTONG20002230//Mus musculus RW1 protein mRNA, complete cds. 

NTONG20005310//Ribosomal protein S9/S16 

NTONG20029850//C ALC Y PHOS I N E (R2D5 ANTIGEN). 

NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN). 

NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds. 
20 OCBBF20005220//Rattus norvegicus Fos-related antigen mRNA, complete cds. 

OCBBF20011860//Mus musculus epithelial protein lost in neoplasm-a (Eplin) mRNA, complete cds. 

OCBBF20016810//enhancer of polycomb [Mus musculus] 

OCBBF20147070//DNA polymerase (viral) C-terminal domain 

OCBBF20160380//liver stage antigen LSA-1 - Plasmodium falciparum OCBBF20177910//Corticotropin-releasing fac- 
25 tor family 

PEBLM20005020/A/irion host shutoff protein 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds. 

PLACE60068710//SUPPRE PROTEIN SRP40. 

PLACE60080360//mucin [Homo sapiens] 
30 PLACE60082850//Pathogenesis-related protein Bet v I family 

PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds. 

PLACE60105680//Homo sapiens mRNA for TU12B1-TY, complete cds. 

PLACE601 1 9700//Homo sapiens mRNA for ABP32, complete cds. 

PU\CE60120280//SER/THR-RICH PROTEIN T10 IN DGCR REGION. 
35 PLACE60132200//TRICHOHYALIN. 

PLACE60181870//Pentaxin family 

PROST20084470//Plasmodium berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds. 
PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20122490//Gallus gallus syndesmos mRNA, complete cds. 
40 PROST20130320//S-100/ICaBP type calcium binding domain 
PROST20152870//Homo sapiens APC2 gene, exon 14. 

PUAEN10001640//Mus musculus cerebellar postnatal development protein-1 (Cpd1) mRNA, partial cds. 

PUAEN20000800//Bleomycin resistance protein 

SMINT20012220//Collagen triple helix repeat (20 copies) 
45 SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 

SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds. 

SMINT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds. 

SMINT20043390//Ras association (RalGDS/AF-6) domain 

SMINT20048720//Cytochrome P450// Cytochrome P450 
50 SMINT200521 30//Rattus norvegicus mRNA for gankyrin homologue, complete cds. 

SMINT20054050//ABC1 PROTEIN HOMOLOG PRECURSOR. 

SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds. 

SPLEN20040780//CORNIFIN B (SMALL PROLINE-RICH PROTEIN 1B) (SPR1B) (SPR1 B). 
SPLEN20041810//BC-2 protein [Homo sapiens] 
55 SPLEN20100040//258.1 KDA PROTEIN C210RF5 (KIAA0933). 
SPLEN20104150//Ribosomal protein L36 
SPLEN20116720//Homo sapiens misato mRNA, partial cds. 
SPLEN201 761307/Homo sapiens mRNA for ALEX1 , complete cds. 
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SPLEN201 81 570//TRICHOHYALIN. \ 
TESTI2000431 0//TRICHOHYALIN. 
TESTI20016970//TPR Domain 
TESTI20030440//TRICHOHYALIN. 
5 TESTI200431 80//mouse mRNA for megakaryocyte potentiating factor, complete cds. 

TESTI20043910//IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmod- 
ulin-binding motif// IQ calmodulin-binding motif 

TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds. 
TESTI20646110//Extracellular link domain 
10 TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds. 
TESTI20049410//Proprotein convertase P-domain 
TESTI20053950//IQ calmodulin-binding motif 

TESTI20054700//Streptbcoccus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8. 1 allele, complete 
cds. 

15 TESTI20055880//Serum amyloid A protein 

TESTI20056030//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds. 
TESTI20061090//Keratin, high sulfur B2 protein 

TESTI20064370//TPR Domain// TPR Domain// TPR Domain// TPR Domain// Synaptobrevin 
TESTI20084250//OXYSTEROL-BINDING PROTEIN. 
20 TESTI200921 70//ENV polyprotein (coat polyprotein) 
TESTI20116050//UBX domain 
TESTI20120500//Kelch motif// Kelch motif 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds. 

TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS13. 
25 TESTI201 65990//Ribosomal protein L36 

TESTI20169500//HYPOTHETICAL51.9 KDA PROTEIN C27F1.04C IN CHROMOSOME I. 

TESTI201 70280//Flagellar L-ring protein 

TESTI20176450//thioredoxin interacting factor [Mus musculus]. 

TEST120179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
so TESTI20180600//Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds. 

TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRE5-FET4 INTERGENIC REGION. 

TESTI20210570//RETINAL-BINDING PROTEIN (RALBP). 

TESTI20215310//Homo sapiens calcyclin binding protein mRNA, complete cds. 

TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds. 
35 TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds. 

TESTI20250220//TRICHOHYALIN. 

TESTI20251440//Rattus norvegicus (rsec6) mRNA, complete cds. 
TESTI20255460//Mus musculus mRNA for MIWI (piwi), complete cds. 
THYMU20009500//TPR Domain 
40 THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1). 

THYMU20013810//Human SEC7 homolog Tic (TIC) mRNA, complete cds. 
THYMU20018250//TPR Domain 

THYMU20026950//Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds. 
THYMU20028410//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds. 
45 THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds. 
THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds. 
THYMU20052460//PHORBOLIN I (FRAGMENTS). 
THYMU20055450//Zona pellucida-like domain 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds. 
50 THYMU20139160//Uncharacterized protein family UPF0031 

THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds. 

TRACH20093400//TRICHOHYALIN. 

TRACH20104510//Uncharacterized protein family UPF0005 

TRACH201 229807/HYPOTHETICAL PROTEIN MJ0798. 
55 TRACH201 39280//PX domain 

TRACH20164810//D-isomer specific 2-hydroxyacid dehydrogenases 

TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta 01 type II deletion. 
UTERU20051790//guany!ate kinase-interacting protein 1 Maguin-1, membrane-associated - rat 



60 



EP 1 308 459 A2 



UTERU20083020//Domain of unknown function DUF71 
UTERU201 21 1 40//Rhodanese-like domain 

UTERU20128560//26.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION. 
UTERU20132620//AXONEME-ASSOCIATED PROTEIN MST101(2). 
5 UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] 
UTERU20181270//Zinc knuckle 

[0091] Further, the reason is that a polypeptide does not always belong solely to a single category of the above- 
described functional categories, and therefore, a polypeptide may belong to any of the predicted functional categories. 
Besides, additional functions can be found for the clones classified into these functional categories by further analyses. 
10 [0092] Since the polypeptide encoded by clones of the invention contains full-length amino acid sequence, it is pos- 
sible to analyze its biological activity, and its effect on cellular conditions such as cell proliferation and differentiation 
by expressing the polypeptide as a recombinant polypeptide using an appropriate expression system, injecting the 
recombinant into the cell, or raising a specific antibody against the polypeptide. 

[0093] The biological activities of respective polypeptides can be analyzed by the methods as shown below. 

15 

Secretory protein, transmembrane protein: 
[0094] 

20 "ion Channels" (Ed., R. H. Ashley, 1995) of "The Practical Approach Series" (IRL PRESS), 

"Growth Factors" (Eds., I. McKay, I. Leigh, 1993), 
"Extracellular Matrix" (Eds., M. A. Haralson, J. R. Hassell, 1995); 

Glycoprotein-related protein: 

25 

[0095] 

"Glycobiology" (Eds., M. Fukuda, A. Kobata, 1993) of "The Practical Approach Series" (IRL PRESS), 
"Glycoprotein Analysis in Biomedicine" (Ed., Elizabeth F.Hounsell, 1993) of "Method in Molecular Biology" (Hu- 
so mana Press) series; 

Signal transduction-related protein: 

[0096] 

35 

"Signal Transduction" (Ed., G. Milligan, 1992) of "The Practical Approach Series" (IRL PRESS), 
"Protein Phosphorylation" (Ed., D. G. Hardie, 1993), or 

"Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J.Hill, 1995) of "Method in Molecular Biology" 
(Humana Press) series; 

40 

Transcription-related protein: 
[0097] 

45 "Gene Transcription" (Eds., B. D. Hames, S. J. Higgins, 1993) of "The Practical Approach Series" (IRL PRESS), 

"Transcription Factors" (Ed., D.S.Latchman, 1993); Enzyme and/or metabolism-related protein: 
"Enzyme Assays" (Eds., ROBERT EISENTHAL and MICHAEL J. DANSON, 1992) of "The Practical Approach 
Series" (IRL PRESS); Cell division and/or cell proliferation-related protein: 

"Cell Growth, Differentiation and Senescence" (Ed., GEORGE STUDZINSKI, 2000) of 'The Practical Approach 
50 Series" (IRL PRESS); 

Cytoskeleton -related protein: 

[0098] 

55 

"Cytoskeleton: Signalling and Cell Regulation" (Eds., KERMIT L. CARRAWAY and CAROLIE A. CAROTHERS 
CARRAWAY, 2000) of "The Practical Approach Series" (IRL PRESS), 

"Cytoskeleton Methods and Protocols" (Ed., Gavin, Ray H., 2000) of "Method in Molecular Biology" (Humana 
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Press) series; Nuclear protein and/or RNA synthesis-related protein: 

"Nuclear Receptors" (Ed., DIDIER PICARD, 1999) of "The Practical Approach Series" (IRL PRESS), 
"RNA Processing" (Eds., STEPHEN J. HIGGINS and B. DAVID HAMES, 1994); 

5 Protein synthesis and/or transport-related protein : 

[0099] 

"Membrane Transport" (Ed., STEPHEN A. BALDWIN, 2000) of "The Practical Approach Series" (IRL PRESS) 
10 "Protein Synthesis Methods and Protocols" (Eds., Martin, Robin, 1998) of "Method in Molecular Biology" (Humana 

Press) series; 

Cellular defense-related protein: 

15 [0100] 

"DNA Repair Protocols" (Henderson, Daryl S., 1999) of "Method in Molecular Biology" (Humana Press) series, 
"Chaperonin Protocols" (Eds., Schneider, Christine, 2000); Development and/or differentiation -related protein: 
"Developmental Biology Protocols" (Eds., ROBERT EISENTHAL and MICHAEL J. DANSON, 1992) of "Method in 
20 Molecular Biology" (Humana Press) series; 

DNA- and/or RNA-binding protein: 

[0101] 

25 

"DNA-Protein Interactions Principles and Protocols" (Eds., Kneale, G. Geoff, 1994) of "Method in Molecular Biol- 
ogy" (Humana Press) series, 

"RNA-Protein Interaction Protocols" (Eds., Haynes, Susan R., 1999); 
30 ATP- and/or GTP-binding protein: 
[0102] 

"Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J. Hill, 1995) of "Method in Molecular Biology" 
35 (Humana Press) series. 

[0103] In the categorization, the clone predicted to belong to the category of secretory and/or membrane protein 
means a clone having hit data with some annotation, such as growth factor, cytokine, hormone, signal, transmembrane, 
membrane, extracellular matrix, receptor, G-protein coupled receptor, ionic channel, voltage-gated channel, calcium 
40 channel, cell adhesion, collagen, connective tissue, etc., suggesting that it was a secretory or membrane protein, or 
a clone in which the presence of nucleotide sequence encoding a signal sequence or transmembrane region was 
suggested by the results of PSORT and SOSUI analyses for deduced ORF. 

[0104] The clone predicted to belong to the category of glycoprotein-related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoprotein-related protein. 
45 [01 05] The clone predicted to belong to the category of signal transduction-related protein means a clone having hit 
data with some annotation, suchasserine/threonine-protein kinase, tyrosine-protetn kinase, SH3 domain, SH2 domain, 
etc., suggesting that the clone encodes a signal transduction-related protein. 

[0106] The clone predicted to belong to the category of transcription-related protein means a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc. , suggesting that the clone encodes 

so a transcription-related protein. 

[0107] The clone predicted to belong to the category of disease-related protein means a clone having hit data with 
some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, UniGene, or nr, where 
the hit data corresponds to genes or polypeptides which have been deposited in the Online Mendelian Inheritance in 

55 Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is the human gene and disease database described later. 
[0108] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism-related protein. 
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[0109] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 
apoptosis, etc., suggesting that the clone encodes a cell division and/or cell proliferation-related protein. 
[0110] The clone predicted to belong to the category of cytoskeleton -related protein means a clone having hit data 
5 with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 
clone encodes a cytoskeleton-related protein. 

[0111] The clone predicted to belong to the category of nuclear protein and/or RNA synthesis- related protein means 
a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA helicase, 
polyadenylation, etc., suggesting that the clone encodes a nuclear protein and/or RNA synthesis-related protein. 
10 [0112] The clone predicted to belong to the category of protein synthesis and/or transport- related protein means a 
clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting that the clone encodes a protein 
synthesis and/or transport-related protein. 

[0113] The clone predicted to belong to the category of cellular defense-related protein means a clone having hit 
15 data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
cellular defense-related protein. 

[0114] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting that the clone encodes 
a development and/or differentiation-related protein. 
20 [0115] The clone predicted to belong to the category of DNA-and/or RNA-binding protein means a clone having hit 
data with some annotation, such as DNA-binding, RNA-binding, etc. 

[0116] The clone predicted to belong to the category of ATP-and/or GTP-binding protein means a clone having hit 
data with some annotation, such as ATP-binding, GTP-binding, etc. 

[0117] As to a protein involved in a disease, it is possible to perform a functional analysis as described above, but 
25 also possible to analyze correlation between the expression or the activity of the protein and a certain disease by using 
a specific antibody that is obtained by using expressed protein. Alternatively, it is possible to utilize the database OMIM, 
which is a database of human genes and diseases, to analyze the protein. Further, new information is constantly being 
deposited in the OMIM database. Therefore, it is possible for one skilled in the art to find a new relationship between 
a particular disease and a gene of the present invention in the most up-to-date database. The proteins involved in 
30 diseases are useful for developing a diagnostic marker or medicines for regulation of their expression and activity, or 
as a target of gene therapy. 

[0118] Also, as for a secretory protein, membrane protein, signal transduction-related protein, glycoprotein -related 
protein, or transcription- related protein, etc., search of the OMIM with the following keywords resulted in the finding 
that the proteins are involved in many diseases (the result of the OMIM search for secrete and membrane proteins is 

35 shown below). Also, association between proteins related to signal transduction or transcription and diseases is re- 
ported in "Transcription Factor Research-1 999" (Fujii, Tamura, Morohashi, Kageyama, and Satake edit, (1999) Jikken- 
Igaku Zoukan, Vol.17, No.3), and "Gene Medicine" (1999) Vol.3, No.2). When cancer is used as an example, as de- 
scribed in "Biology of Cancer" (S. Matsubara, 1992) of Life Science series (Shokabo), many proteins are involved in 
cancers, which include enzyme and/or metabolism-related proteins, cytoskeleton-related proteins, cell division and/or 

40 cell proliferation- related proteins as well as secretory proteins, membrane proteins, signal transduction-related proteins, 
glycoprotein-related proteins, transcription-related proteins. As clearly seen by the above example, it is evident that 
not only disease-related proteins but also secretory proteins, membrane proteins, signal transduction-related proteins, 
glycoprotein-related proteins, transcription-related proteins, etc. are often involved in diseases, and thus they can be 
useful targets in the field of medical industry. 

45 [0119] The result of the OMIM search for secretory and membrane proteins is shown below, in which the keywords, 



(1) secretion protein, 

(2) membrane protein, 

(3) channel, and 

(4) extracellular matrix were used. 



[0120] Shown in the search result are only the accession numbers in the OMIM. Using the number, data showing 
the relationship between a disease and a gene or protein can be seen. The OMIM data has been renewed everyday. 

1) Secretion protein 

354 entries found, searching for "secretion protein" 

*604667, *104760, *176860, *151675, *139320, *107400, *604029, M18910, #200100, M76880, *603850, 
*1 47572, *604028, M 7951 3, *1 25950, *1 39250, *246700, *600946, *600560, *602926, 1 85860, *605083, *60321 5, 
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*602421, '157147, '179512, *600174, *109270, '604710, *138120, *179510, *600998, '179509, *170280, 
*179511, '600626, '603831 , '601489, '154545, '179490, 

*603826, '122559, '603216, *1 02720, '147290, '164160, '603062, '112262, '602672, '605435, '605322, 
•131230, '601652, '603166, *601746, *601591, '179508, #160900, '104311, '600759, '147545, '167805, 
5 #104300, '167770, #219700, '168470, *601684, '602049, '601146, '605227, '602434, '602534, '114840, 

*603489, *604323, M 07470, *600753, *600768, *118825, *600564, 

•604252, *173120, '134370, *192340, *308230, *600322, *605359, *600046, *300090, 106160, *600041, 
#262500, *605563, *1 50390, *158106, *1 82590, #103580, •104610, #173900, *1 34797, *1 43890, #145980, 
•306900, '308700, *1 76300, *227500, *1 37350, #154700, *1 38079, *600760, *1 07730, *1 42410, *1 47670, 

10 *1 24092, *590050, •152760, *600509, *605646, *201910, *227600, 

*1 52790, '300200, *300300, 300800, *1 381 60, *1 07741 , *1 201 50, *601 1 99, *1 201 80, *1 201 60, *1 76730, *1 331 70, 
•122560, *1 07300, *1 37241, *120140, *101000, *1 93400, *217000, *272800, *600937, #201710, *600377, 
#174800, *106100, #274600, *1 73350, #177170, *1 47620, *214500, *131244, *202110, *120120, *601007, 
•191160, *1 47470, *603372, *600733, *252800, M90160, 

15 *138040, *158070, M62151, #125700, #130070, '113811, *603355, *171060, *136435, #184700, •603732, 

•190180, *1 64008, *1 86590, *1 20220, *604312, *1 52200, *138130, *605085, *605353, *600840, #166210, 
•188545, *207750, *173360, *601933, #194050, '153450, '138850, '253200, '307030, '157145, '600514, 
'600262, *264080, '147380, '600281 , #204000, #227810, '232200, 

•188826, *232800, M61561, #166200, M88400, *153620, *182099, *218040, #265800, *172400, #177200, 
20 •176805, #211600, #214700, #176410, M52780, *600633, *601771, *301500, *605402, '601922, '307800, 

'147892, '147720, '312060, #520000, '147660, '106150, '602358, '107270, '601769, '147440, '604558, 
'131530, '600270, '601610, '603692, '603401/600423, '601604, 

•603345, #125853, *602843, *142640, *603044, *605740, M34830, *602779, *130660, *139191, *137035, 
•600761, *601340, *600823, *107740, M30160, *600877, •605110, *600945, *130080, •600957, #130050, 
25 •605580, *1 1 8444, *601 1 24, *1 24020, 1 22470, *1 20700, *603201 , *1 3721 6, *601 1 85, *1 38945, '21 8030, '600839, 

#240600, #262400, #162300, '162330, '188450, #265850, 

'263200, '162641, *300159, *601038, #191390, *201810, *601398, *602384, *131240, *602423, *139392, 
•142703, *602663, *232700, *602682, #602722, *602730, '600734, '188540, '182452, '601538, '60306.1, 
'146880, '603140, '603160, '142704, #252650, '182280, '125255, '603252, #131750, '182139, '182100, 

30 #259420, #261100, '603493, '601745, '182098, '603795, '123812, 

'600264, '147940, M80246, *180245, M18888, #604284, M68450, *118455, *604398, *604433, *601919, 
•118445, *600031, *604961, *605032, *605033, *171050, #171300, *131243, *109160, '605254, 274900, 
#171400, *600042, '151670, '184600, '605470, '605546, '176760, '602008, '102200, '605720, '600732, 
'605901 

35 2) Membrane protein 

1489 entries found, searching for "membrane protein" 

'130500, '605704, '305360, '153330, '173610, '109270, '170995, '170993, *1 04776, *602333, *309060, 
•605703, '120920, '605943, '602690, '159430, '600897, '133090, '601178, '602413, '602003, '604405, 
'605940, '603237, '109280, '600378, '602173, '107776, '602334, '602335, '125305, '601134, '309845, 

40 *605731, *1 54045, *603241 , *603718, *600594, *603214, *185881, 

•603657, *600182, *603177, *605331, *601476, *605456, *601114, *605190, *600723, *603904, *136950, 
•300222, *602879, *1 85880, *605348, *300096, *602257, *1 77070, *310200, *603062, *603344, *600039, 
•602977, *300100, *1 28240, *600959, *600322, *227400, *1 86945, *600946, *602534, *602048, *1 82900, 
•601097, *600267, *602625, *1 36430, *602421 , •601047, *1 07450, 

45 *143450, *603141, *184756, *164730, M59440, *154050, *600579, '312080, '604202, '603700, '600447, 

'256540, '604691, '158343, '600403, '602414, '137290, '176640, '176981, '600179, '600754, '604456, 
'604693, '605875, '604605, '188860, *300172, *602910, *604323, *219800, *601848, *603179, *600279, 
•602251 , #222700, *603831, *605072, *605377, *601028, *604155, 

•108733, *104225, *601896, *601510, M73335, *107770, '601767, '600046, '603850, '600040, '603784, 
50 *603234, 188560, *605863, *121015, *605862, *605861 , *1 86946, *604252, *603215, *1 42461 , *604597, *6031 43, 

•605264, *603735, '176860, '605536, '176801, '180721, '603355, '104760, '131560, '310300, '602631, 
'304700, #309400, '603142, *143890, *605431 , *600753, 

•115501, *1 76790, *600266, *601691, M68468, *601239, *602216, #104300, ^OSeiS, *601595, *605550, 
•125950, *605475, *602217, *602261, *603534, *602262, *604631, M90315, ^OISIS, *604306, '104311, 
55 *604672, *605000, '602461, '605548, '602296, '604376, '121014, '121011, '600691, '604262, '139310, 

'304040, '605445, '179514, '179512, '151460, #160900, '120130, 

*1 28239, *601 1 58, ^601 403, *1 76943, *601 0 1 4, 300800, *300294, *601 757, *1 85470, *273800, *605034, *602887, 
#185000, '604871, *603593, *603583, *605454, *1 04775, *605872, M41180, *602713, '603531, '139150, 
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'601531, '601832, '605452, '134651, '604156, '120620, '605883, '604142, '166945, '605324, '600816, 
*604699, '300112, '605182, '600164, '182180, '605071, 

'300023, '605057, '308240, *300249, '176947, *1 76894, '605081, '605035, '602044, *1 82860, '107271, 
*305100, '153390, '113730, '602689, '180069, '603518, '300017, '191275, '177061, '601693, '601789, 
5 '604241, '600934, '138160, '604424, '603868, '600174, '600718, '600523, '604141, '601009, '605251, 

'600481, '600874, '155550, '605227, '601017, '162230, 601138, 

'6041 57, '601 21 2, '600763, '6041 1 0, '6041 58, '601 1 07, '601 326, 600621 , '600587, 601 1 37, '60091 7, '600855, 
'605058, '194355, '605194, '603291, '102720, '136425, '170715, '603216, '605547, '135630, '602926, 
'600168, '605002, '602474, '600157, '603025, '603893, '231200, '120090, '601966, '131230, '604722, 

10 '604721 , '604515, '246700, '602101 , '605628, '303630, 

'605787, '602857, '602285, '605708, '602488, '605025, *60381 7, '300051 , '603293, '176878, '603646, 605707, 
185860, '112205, '300187, '602654, '120070, '603648, '604850, '602655, '602514, '300118, '182309, '179590, 
'602701, '600759, '204200, '604170, '175100, #103580, '147670, '306400, '143100, '182870, '257220, 
'180380,. #11 6920, '301000, '193300, '157147, 

15 '131550, '139200, '139130, '190195, '605406, '155760, '155960, '605734, '155970, '605385, '111700, 

'155975, '150370, 605709, '151430, '605438, '151510, '116952, '157655, '158105, '605777, '176877, '153619, 
'120131, '185430, '109190, '120190, '109170, '605093, '605250, '153432, '107777, '186590, '160993, 
'605699, '605698, '605813, '605697, '605616, '605300, 

'162060, '605219, '163970, '135620, '165040, '605478, '604964, '103195, '604932, '604923, '605906, 
20 '605496, '60591 4, *1 66490, 1 38277, '60491 5, *1 1 4070, '60521 3, '605933, *1 80297, *1 01 000, *1 91 1 63, *1 91 1 64, 

'605101, '603167, '600772, '603164, '600708, '604001, '191328, '313440, '602672, '604009, '604299, 
'192974, '604256, '603048, '600515, '604221 , '602632, 

'604196, '601179, 603290, '604661 , '601023, '601110, '304800, '203200, '300212, '602933, '603352, '208900, 
'604418, '604838, '600551, #212140, '604837, '602049, '600552, '600553, '300213, '602574, '600583, 
25 '600932, '603452, '604775, '516020, '604617, '604464, '603498, '300145, '601523, '602694, '600632, 

'604762, '604492, '40001 5, '604504, '601 71 7, '601 728, 

'300242, '602426, '604194, '603821, '604730, '600695, '603823, '603869, '300241, '600707, '603822, 
'602370, '602202, '604193, '601181, '604089, '602507, '604195, '602306, '300284, '601805, '601895, 
'601275, '604660, '600752, '603820, '604192, '602207, '308230, '600894, '312600, '603199, '604029, 

30 '602500, '102680, '235200, #256300, '601633, #219700, 262890, 

M56225, '173470, '193400, '173910, '600354, '113705, '600065, '107741, '107400, '600024, '131195, 
'113811, #118220, '601638, '300011, '276903, '604144, '311770, '601758, #173900, '604592, '120120, 
'179605, '603130, '603372, '110750, '222900, '602509, '256100, '602469, '602281, '229300, '224100, 
'110900, '190180, '261600, '602997, '603616, '603189, 601791, 

35 '601567, '312700, '171060, '308700, '604027, '162643, '516000, '176261, '604028, '314850, #145980, 

'601383, '600930, '305900, '601253, '136350, '605537, '138140, '604033, '605070, '139250, '300500, 
'603967, '300041, '603866, #130600, '120150, '601050, '604942, '605204, '605248, '272750, '600163, 
'604235, '600682, '107266, '306900, '191092, #262500, '600106, 

'152790, '186720, '227650, '153700, '308380, '103390, '605646, '164920, '604478, #252650, '173850, 
40 '173350, '602505, '246530, '194380, '602575, '603030, #209920, '212138, #214100, '605767, '600582, 

'189980, #176200, '604653, '604678, '256550, '300037, '253700, #253300, #226700, '604766, #244400, 
'190000, '188040, '604824, *214500, #237500, '232300, '605014, 

'604477, '190930, '605124, '604475, '604594, #227810, '306700, #301050, '600135, '600143, '605145, 
#269920, '300104, '277900, '300135, '300231, '192500, '182138, '191190, '176805, '600185, '186591, 
*s '604889, '603051, '165360, '147545, '601040, #156575, '107269, '603009, '602934, '123825, '601081, 

'602924, '163890, '600381 , '602909, '150330, '109690, '123900, 

'603434, '603491, '110700, '602581, '125647, #154700, '114760, '141900, '603690, '120220, '601199, 
#145500, '601309, '602382, '120325, '600877, '604205, '604090, '601497, '602377, '605464, '138720, 
'603728, '120950, '604026, '600580, '601610, '137167, '603960, '603931, '601880, '603126, '138190, 

50 '130130, '601997, '601975, '600395, '516040, '600418, '600650, 

'605245, '605172, '600509, '164761, '310400, '600308, '605109, '600544, '600359, '600103, '605267, 
'312610, '176100, '308100, '158070, '605123, '173325, #312750, '600839, '158120, #604369, '604465, 
'173510, #161200, '151525, '605369, '604237, '516050, #600886, '604517, '165180, '605381, '605399, 
'307800, '604365, '155740, '147795, 601709, '604673, '147730, 

55 '602122, '147557, '193245, '600978, '604990, '603261, '603274, '601007, '131100, '602941, '107941, 

'146710, '276901, '131244, '602872, '603411, '186357, '176290, '601066, '185050, '232200, '143030, 
'601843, #236700, '604122, '142800, '134638, '604985, '182380, '603930, '142410, '137060, '604586, 
'6011 93, *1 20650, '252500, '253800, '120930, '604858, '605874, 
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601274, '602158, '605873, '193210, '203100, '601295, '604095, #201710, '126150, '108740, #205400, 
'601373, '300167, '109545, '602894, '603361, #300257, '266200, '603401, '131390, '180470, '605908, 
•604798, #221770, '223360, *1 80901, '605641, '605745, '604018, '300200, '604603, '230800, '602676, 
#604004, '605692, '602640, '601599, '134637, '245900, '118425, 
5 601 61 4, '605725, *1 201 1 0, *3001 89, '300035, '6031 02, '250800, '602282, '602458, *1 2361 0, '603754, '300278, 

'601463, '300224, '601581, '182160, '601653, '139191, '601733, '600748, '142460, '601194, '152390, 
'153620, '601615, '601814, '601617, '601613, '300191, #308300, '600798, 601858, '601872, '601597, 
#601588, '600821, '147840, *1 52427, *1 38850, '600823, 

*601492, *300256, *600840, *300267, *601411 , *1 39080, *1 39090, 600851 , *300334, *1 79080, *602095, *601284, 
10 *601282, #177200, *601681, *601252, *176000, *602184, *602188, #266510, #154020, *186711, *257200, 

•601711, *600667, •602241, *1 86745, *255125, *300126, *600644, *1 23890, #255120, #175200, '600004, 
•302060, *123580, *186760, *122561 , *602316, *600017, 

*1 20940, 1 40300, *1 51 690, *1 20700, *602354, *60001 9, *600857, *1 821 75, *600536, *1 58380, *60051 6, *1 20290, 
•600493, *182310, #252010, M82530, # 186830, *601839, *142790, *159465, *118990, *250790, *248600, 
15 #248250, *186845, *601153, *142600, M16930, *114860, *171834, #303600, *1 86880, *600444, *1 42871, 

•601852, *602602, *602607, *114207, •186910, #232220, 

600880, *1 34635/112203, #112100, # 111680, *231680, *311030, *111250, M11200, *134390, #226670, #145600, 
•226200, *602714, *171760, *1 33550, *602727, *161555, *602744, *602746, #131705, *602835, *600423, 
•176267, *602859, #600918, 277175, *602874, *601020, *109770, *600170, *217070, *173515, *602893, 

20 *147280, *154360, *171050, *108780, *176257, *600979, 

•600377, *108360, *204500, *170260, •146880, *154582, -601011, *600997, *602992, *201475, *603005, 
•190198, *147360, #270400, *600238, #164970, *306250, #126600, *193065, #181350, *106180, *602136, 
•600937, *603086, *603087, *307030, *1 82099, *1 03320, *601683, #192430, M 031 80, *1 02681, *1 92321, 
•600244, *1 91 740, *191315, *603152, *1 02642, *191305, #266140, 

25 *1 00500, *600867, *604585, *604404, *604345, *603201, *605430, *603207, *603208, *605433, *604101, 

•603969, *605896, *604616, *605851, *605768, *604576, *605754, *605730, *605477, *603263, *605538, 
•603283, *604402, *605453, *605427, *603302, *605458, 603313, *60441 5, *603345, *605541 , *603353, *605295, 
•603879, *605268, *605266, *605246, *603377, *603380, 

•605181, *604203, •603425, •603867, *605106, *605017, *603842, *604936, *603510, *604857, *605932, 
30 •605816, *603765, *603551, *605357, *605237, *604204, *603594, *605110, *604190, *603861, *604962, 

•603639, *603644, *605007, *605349, *604943, *60491.8, *604907, *603667, *603681, *605396, *605561, 
•603712, *603713, *605688, *605942, *604878, *604843, *604659, 

•604671, *603798, *604682, •604056, *604705, *603749, 602586, *603647, *602515, #602475, *603717, 
•602359, *602372, *602380, *60251 8, *603652, *602573, *603626, 602587, *603598, *602871 , *60361 3, •603750, 
35 *603875, *602608, *602666, *602345, *602935, *603564, *603548, *603927, 601 876, *602343, *603943, *603787, 

•601730, *601611, *602679, *603788, *602243, 

603790, *601535, *603796, *601488, *601485, *602314, *601478, *604047, *604048, *602297, *604057, *602715, 
•602192, *601459, *601416, *603833, *602190, *604102, *602106, *604111, *602724, *603499, *602736, 
•601123, *601002, *600923, *601987, *604149, *601929, *600910, *600900, *600864, *604165, *600782, 

^0 *602836, *600769, *600742, *602783, *601905, *600535, 

•604198, *601901, *600534, *602876, *603356, *600530, *604216, *604217, *602890, *602905, *600465, 
•600464, *600446, *602891, *603366, *601894, *604272, •603926, *603312, *600368, *602914, *600327, 
•603151, *603202, 602911, *602974, *603006, *601883, *603008, *600074, *603007, *603046, #603903, 
•604433, *600016, *603925, *516005, *516004, *516003, *601756, 

45 *604487, *516001, *313475, *313470, #307810, *604527, *604528, *601745, *604551, *604555, *603243, 

•603242, *603061, *603063, *603217, *300335, *300283, *300281, *604600, *300197, *603097, *603220, 
•601625, *604623, *603118, *601590, *604646, *300008, *601568, *300007, *275630, *601533, #275200, 
•270200, #261550, *604031 , *604683, #254800, *251100, #242300, 

•604058, *604720, *240500, *233690, #232240, #226730, *223100, *222100, #220100, '216950, *604832, 
50 212750, 212067, *604066, *1 93067, 601315, *1 93001 , *604862, *604870, *191306, •600385, *604879, *191191 , 

•601296, *604914, *190181, *604119, #188550, *604925, *1 88410, #601287, *604939, *1 88380, *604126, 
•604945, *6041 48, *1 88060, *604982, *1 86854, *604988, 

M86360, *1 86355, *1 85250, *600916, *605008, *605009, 1 85020, *600734, *605024, *1 82331 , *605032, *605033, 
•182305, *1 80903, #179800, *1 79610, *605060, *1 79410, *1 78990, *1 76802, *605080, •176266, *1 76263, 
55 *1 76260, •600732, *1 73490, *6041 99, *1 73445, *1 73391 , 1 72290, *6051 47, *6051 49,*171 890, *600528, *1 71 833, 

•605185, #170500, *605193, #168000, *605196, 

M67055, *605205, *605208, 166900, *605216, '162651, '162010, *600504, #161400, *604253, #160800, 
•159460, *154540, *605254, *605261, *153634, *600429, *153337, '600424, '605292, #604286, #152700, 
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152423/152310, '151625, '600153, '604313, '151523, '150325, '150320, '150292, '603150, '150290, '150210, 
*605410, '605415, '605416, '605417, '605421 , '603149, 

'604349, '147940, '600282, '147880, '146928, '146661, '600150, '146630, '142622, '600018, '605461, 
'138981, '138590, '600023, '138330, '605495, '138297, '605512, '138230, #136900, #301310, '516006, 
5 '605545, '605546, '136131, '134660, '134350, '516002, '605589, '131235, #130050, '605625, '126455, 

'126064, #125310, '605670, '604534, '125240, '123836, '123830, 

'123620, '605702, #122200, '120980, '120360, '118510, '114835, '605710, '605716, '605722, '114217, 
'604561, '113810, '111740, #110800, '605748, '605752, '604564, '110600, '603160, '109610, '605784, 
#107480, '107273, '603192, '300169, '106195, '105210, '104615, '104614, '104210, '103850, 103581, 
10 '605876, '605877, '605879, '103220, '605887, '300150, '102910, 

'102670, '102576, '605916, '604629, '102575, '102573, '300132, '101800, '605947 
3) Channel (member of membrane protein) 
361 entries found, searching for "channel" 

'176266, '600724, '182390, '123825, '114208, '114206, '176267, '114205, '601784, '600937, '114204, 
15 '603415, '600053, '114207, '114209, '605427, '604527, '604528, '600760, '601011, '192500, '118425, 

'600228, '176261, '602235, '600761, '600359, '300008, '182389, '600877, '602232, '176263, '182391, 
'601328, '600054, '603939, '602208, '601534, '600504, '602323, 

'603208, '601958, '603537, '601012, '601327, '600734, '602780, '602781, '604433, '603220, '182392, 
'605874, '605873, '601745, '603888, '603219, '602604, '603796, '302910, '602866, '601013, '602905, 
20 '602906, '603967, '600163, #170500, '152427, '180901, '176260, #601462, '603951, '601141, '604492, 

'600702, '602023, '600308, '602754, '107776, '176257, '602024, 

'601949, '605222, '601142, '602983, '193245, '600681, '176265, '600235, '176262, '176258, '605206, 
'604427, '605411, '603305, '601219, '600150, '604065, '602343, '605223, '605720, '603906, '138249, 
'138253, '600843, '604385, '600003, '600935, '603940, '602727, '602158, 602911 , '600397, '602726, '600845, 

25 *605080, '600580, '602872, '6021 06, *1 76264, '603953, 

'605722, '3001 1 0, *1 38252, '6041 1 1 , '60271 7, '602420, '600570, 600844, '603493, '600932, '60571 6, *1 38254, 
'603652, '300138, '605410, '176268, '605214, '605696, '300334, '604660, '176256, '605879, '603749, 
'603583, '602345, '604661 , '603787, 60331 3, '602982, '604337, '600846, '604662, '300328, '300281 , '602566, 
'602836, '604003, '603788, '603651 , '602421 , 

30 *1 07777, #177200, '100725, #219700, '100690, '100710, #160800, #603830, #183086, '600509, #220400, 

#601144, '173910, '180902, '605692, #264350, #160900, #145600, #255700, '602076, '603061, '601313, 
'154275, #604233, '604532, #108500, #121201, #170400, '300225, '121014, '139311, #125800, #160120, 
'118503, 601439, #141500, #168300, '304040, #601887, #256450, 

'186945, '154276, #300009, #216900, '600040, '601014, '601042, '602512, '601383, '605445, '602368, 
35 '603831, #117000, '601218, '108745, '605248, #177735, #173900, '601212, '182139, '601059, '600039, 

'601485, '180903, '186360, '603319, #600101, '118509, '600109, #121200, '600170, '604187, '176975, 
'137163, #310468, #263800, #262300, '603750, '600229, '124030, 

'602251, #603829, '137143, #145500, '600669, '147450, '154050, '603353, '600516, '601157, '600855, 
'601154, '602522, '249210, '600968, #252650, '171060, '600919, '156490, #259700, #601678, '601764, 
40 #310500, '131244, '300041, '121011, '125950, '114180, '602974, '600637, '113730, '118504, '605145, 

'604669, '118800, '121013, '121015, '138491 , '600421 , '104610, 

'604045, '604594, '131230, '605487, '138247, '600467, #602485, '602481, '138251, '137192, '602403, 
600851, '277900, '603785, '603152, '603199, '603475, #168600, #272120, '170280, '603852, #241200, 
'603053, '600465, #603034, '142461, '164920, '137164, '600884, '600442, '123885, '604001, '600232, 

45 *232200, '171050, '602103, '602014, '300211, '600983, '602887, 

'60441 5, '60441 8, '300242, #300071 , '604471 , '600837, 1 68350, *1 1 851 1 , 1 93007, '600300, '604654, #601 820, 
'180297, '600046, '603853, '604678, '604693, #604772, '118508, '603855, '605204, #254210, '182099, 
'182307, #130600, '601109, '114080, '300103, '182860, '605438, '601129, '603964, '600019, '516060, 
#185000, '138079, '104210, '605818, '603418, '305990, '305450 

50 4) Extracellular matrix 

21 8 entries found, searching for "extracellular matrix" 

'605912, '603479, '602201, '604633, '601418, '601548, '115437, '154870, '600754, '602261, '602285, 
'602262, '134797, '120361, '604629, '604871, '603321, '603320, '601807, #154700, '116935, '185261, 
'120360, '185250, '605470, '603767, '253700, '190180, '128239, '308700, '276901, '193300, '120324, 
55 '188826, '602109, '155760, '600514, '600261 , #177170, '600536, 

'147557, #116920, '150240, '601313, '120140, 601614, '605158, '120150, '120180, #200610, '605127, 
'193400, '192240, #173900, '152200, #136900, '135821, #130070, '120320, '120220, '112260, '310200, 
'600900, '600262, '605670, '600985, '179590, #245150, '602574, '601463, 183850, '601211, '604241, 
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*600758, *1 86745, *604710, *602369, *602090, M90182, *1 92975, 

•602178, *230740, *600065, *601652, *158106, *190181, *156790, #158810, M93210, *155120, *192977, 
•193065, #226700, *187380, *231050, M82120, *188060, *186355, 163200, *164010, #156550, *151510, 
*150370, *253800, *156225, *150325, #194050, *150290, *216550, M47620, *600215, *222600, *147559, 

5 *165380, *182888, *600491 , *1 46650, M46640, *600564, *600596, 

•600616, *600700, *600742, *138297, *182889, *154705, *600930, *301870, '153619, *601050, *601090, 
•601105, *165070, *305370, *135820, •130660, *310300, *601492, *128240, *601587, #126600, *601636, 
•6001 1 9, *601 692, *601 728, *1 25485, 601 858, *601 91 5, *602048, M 751 00, *6021 08, *1 21 01 0, *600245, *1 20470, 
•120328, *120325, *602264, *120280, *602366, *600309, 

10 *602402, *602415, *602428, *602453, *602505, #166210, *602600, *602941, *603005, *603196, 603209, 

•603221, *603234, *603319, *1 20250, *1 20210, *120120, *603489, *603551, *1 18938, *603799, *603842, 
•603924, *603963, *604042, *604063, *604149, *604160, *601028, *604467, *604510, *604592, *1 16930, 
•116806, *601284, *604724, *604806, *604807, *604808, M07269, 

•605007, *605008, *605009, *600214, *600076, *605174, *605175, *605292, *605343, *605351, #600204, 
15 ^605497, *605546, *605587, *605623, *600211 , *605702, *1 03320 

[0121] In addition to these, the various keywords shown in the above-mentioned categorization or others can be 
used for the OMIM search and the result may suggest the involvement thereof in diseases. 

[01 22] Further, the use of nucleotide sequences of cDN As of the present invention enables analyzing the expression 
20 frequency of genes corresponding to the cDNAs. In addition, functions of the genes can be predicted based on the 
information obtained by the expression frequency analysis. 

[0123] There are several methods for analyzing the expression levels of genes involved in diseases. Differences in 
gene expression levels between diseased and normal tissues are studied by the analytical methods using, for example, 
Northern hybridization, RT-PCR, DNA microarray, etc. (Experimental Medicine, Vol.17, No. 8, 980-1056 (1999); Cell 

25 Engineering (additional volume) DNA Microarray and Advanced PCR Methods, Muramatsu & Nawa (eds.), Shujunsya 
(2000)). By computer analysis, in addition to these analysis methods, the nucleotide sequences of expressed genes 
can be compared to analyze the expression frequency. For example, there is a database called "BODYMAP"; gene 
clones are extracted at random from cDNA libraries of various tissues and/or cells, and the clones homologous to one 
another are assigned to a single cluster based on the information of nucleotide sequence homology at the 3'-end; 

30 genes are classified into any clusters, and the numbers of clones in the respective clusters are compared to gain the 
information on expression frequency (http://bodymap.ims.u-tokyo.ac.jp/). 

[0124] When explicit difference in the expression levels between diseased tissues and normal tissues is observed 
for a gene by these analytical methods, it can be conclude that the gene is closely involved in a disease or disorder. 
Instead of diseased tissues, when gene expression is explicitly different between normal cells and cells reproducing 

35 disease-associated specific features, it can be concluded that the gene is closely involved in a disease or disorder. 
[0125] From the 1970 clones whose full-length nucleotide sequences had been revealed, genes involved in particular 
pathology or functions were selected by the use of databases shown below (see Example 7; "Expression frequency 
analysis in silico"). The database used in the analyses of the present invention contains nucleotide sequences of 
770,546 clones, and the population of the database is large enough for the analysis. The sequence information in the 

40 database was obtained by selecting cDNA clones at random from cDNA libraries derived from the various tissues and 
cells shown in Example 1 and determining the 5'-end sequences thereof. 

[0126] Then, the nucleotide sequences of respective clones in this database were categorized (clustered) based on 
the nucleotide sequence homology determined with a search program; the number of clones belonging to every cluster 
of each library was determined and normalized; thus, the ratio of a certain gene in a cDNA library was determined. 
45 This analysis provided the information of the expression frequency of a gene in a tissue or cell that is the source of the 
cDNA library. 

[0127] Then, in order to analyze the expression of genes corresponding to the nucleotide sequences of cDNAs of 
the present invention in tissues and cells, the libraries from the tissues or cells, which had been used in the large-scale 
cDNA analyses, were taken as subjects to compare the expression levels between different tissues or cells. Namely, 
50 the expression frequency was analyzed by comparing the previously normalized values between tissues or cells from 
which 600 or more cDNA clones whose nucleotide sequences had been analyzed were derived. The result of this 
analysis showed that the cDNA clones corresponded to the genes involved in the pathology and functions, which are 
indicated below. Each value in Tables 3 to 39 indicated below represents a relative expression frequency; the higher 
the value, the higher the expression level. 

55 

Osteoporosis-related genes 

[0128] Osteoporosis is a pathology in which bones are easily broken owing to overall decrease in components of 
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bone. The onset correlates to the balance between the functions of osteoblast producing bone and osteoclast absorbing 
bone, namely bone metabolism. Thus, the genes involved in the increase of osteoclasts differentiating from precursor 
cells of monocyte/macrophage line (Molecular Medicine 38. 642-648. (2001 )) are genes involved in osteoporosis rel- 
evant to bone metabolism. 

5 [0129] A nucleotide sequence information-based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower in CD34+ cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 
entiation factor (Molecular Medicine 38. 642-648. (2001 )) than in the untreated CD34+ cell, which is the precursor cell 
of monocyte/macrophage line. The result of comparative analysis for the frequency between the cDN A libraries pre- 
pared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ cells treated with the osteoclast differentiation 

10 factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the two were 
26 clones indicated in Table 3. These clones are involved in osteoporosis. 

Genes involved in neural cell differentiation 

15 [0130] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
logical diseases. 

[0131] A survey was performed for genes whose expression levels are varied in response to induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a neural 
20 strain, NT2. The result of comparative analysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RM) and 
the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
levels were different between the two were 193 clones indicated in Table 4. These genes are neurological disease- 
related genes. 

25 Cancer-related genes 

[0132] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
thus the expression thereof can contribute to the carcinogenesis in tissues and cells. Thus, genes whose expression 
patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
30 for the genes whose expression levels in cancer tissues were different from those in normal tissues. 

[0133] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 
(BEAST) showed that the genes whose expression levels were different between the two were 4 clones indicated in 
Table 5. 

[01 34] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
35 (CERVX) showed that the genes whose expression levels were different between the two was one clone indicated in 
Table 6. 

[0135] The result of comparative analysis of cDNA libraries derived from colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two was one clone indicated in 
Table 7. 

40 [0136] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (NESOP) showed that the genes whose expression levels were different between the two were 6 clones 
indicated in Table 8. 

[0137] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were 132 clones indicated 
45 jn Table 9. 

[0138] The result of comparative analysis of cDNA libraries derived from liver tumor (TLIVE) and normal liver (LIVER) 
showed that the genes whose expression levels were different between the two were 25 clones indicated in Table 1 0. 
[0139] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and normal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were 99 clones indicated in 
50 Table 11. 

[0140] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
(NOVER) showed the genes whose expression levels were different between the two were 3 clones indicated in Table 
12. 

[0141] The result of comparative analysis of cDNA libraries derived from stomach tumor (TSTOM) and normal stom- 
as ach (STOMA) showed that the genes whose expression levels were different between the two were 1 5 clones indicated 
in Table 13. 

[0142] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were 97 clones indicated in 
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Table 14. 

[0143] The result of comparative analysis of cDN A libraries derived from tongue cancer (CTONG) and normal tongue 
(NTONG) showed that the genes whose expression levels were different between the two were 203 clones indicated 
in Table 15. 

5 [0144] These genes are involved in cancers. 

[0145] Further, there is a method to search for genes involved in development and differentiation, which is the ex- 
pression frequency analysis in which the expression levels of genes are compared between developing and/or differ- 
entiating tissues and/or cells and adult tissues and/or cells. The genes involved in tissue development and/or differ- 
entiation are genes participating in tissue construction and expression of function, and thus are useful genes, which 

10 are available for regenerative medicine aiming at convenient regeneration of injured tissues. 

[0146] By using the information of gene expression frequency gained from the database of 5'-end nucleotide se- 
quences described above, genes involved in development or differentiation of particular tissues were selected from 
the 1970 clones whose full-length nucleotide sequence had been revealed (see Example 7). 
[01 47] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRA or OCBBF) and 

15 adult brain (BRACE, BRALZ, BRAMY, BRAWH, BRCAN, BRCOC, BRHIP, BRSSN, BRSTN or BRTHA) showed that 
the genes whose expression levels were different between the two were 775 clones indicated in Tables 16 to 36. 
[0148] The result of comparative analysis of cDNA libraries derived from fetal heart (FEHRT) and adult heart (HEART) 
showed that the genes whose expression levels were different between the two were 28 clones indicated in Table 37. 
[0149] The result of comparative analysis of cDNA libraries derived from fetal kidney (FEKID) and adult kidney 

20 (KIDNE) showed that the genes whose expression levels were different between the two were 95 clones indicated in 
Table 38. 

[0150] The result of comparative analysis of cDNA libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were 99 clones indicated in Table 39. 
These genes are involved in regeneration of tissues and/or Cells. 

25 [0151] The expression frequency or the like can be analyzed by PCR based on the nucleotide sequences of cDNAs 
of the present invention. There are some known methods for comparing the quantities of amplification products obtained 
by PCR. For example, the band intensities can be determined by ethidium bromide staining. With Rl-labeled or fluo- 
rescently labeled primers, the Rl signal or fluorescence intensity can be assayed for the quantity of labeled amplification 
products. Alternatively, the quantity of amplification products can also be determined by measuring the Rl signal or the 

30 fluorescence intensity from the Rl-labeled or fluorescently labeled probe hybridizing to the products. The assay results 
thus obtained are compared and then the clones exhibiting differences in the expression levels can be selected. 
[0152] There are some quantitative PCR methods: a PCR method using internal standards; a competitive PCR, in 
which the quantification is achieved by adding, to a sample, a dilution series of a known quantity of a template RNA 
and by comparing the quantity of an amplification product derived from the RNA of interest with the quantity of an 

35 amplification product derived from the template RNA. These methods overcome the problems of errors in the amount 
of amplification products among tubes and of the plateau effect. ATAC-PCR (Adaptor-tagged competitive PCR) is a 
method of competitive PCR which is practiced by using multiple adapters of different sizes attached to a gene whose 
3' -end nucleotide sequence has previously been determined. The ratio of expression frequency of a single mRNA 
species from a number of tissues (cells) can be assayed in a single step (Nucleic Acids Research 1997, 25(22): 

40 4694-4696; "DNA Micro-array and Advanced PCR Techniques", Cell Technology, supplement, Eds., Muramatsu and 
Nawa (Shujunsha, 2000): 104-112). 

[0153] If it is observed, by using these analytical methods, that the expression levels of genes are evidently varied 
during major cellular events (such as differentiation and apoptosis), the genes are involved in the cellular events and 
accordingly are candidates for disease- and/or disorder-related genes. Further, genes exhibiting tissue-specific ex- 
45 pression are genes playing important parts in the tissue functions and, therefore, can be candidates for genes involved 
in diseases and/or disorders affecting the tissues. 

[01 54] For example, inflammation is an important biological response that is known to be involved in various diseases. 
The representative inflammation-inducing factors include TNF-a (Tumor Necrosis Factor-alpha). There exists a sign- 
aling cascade activated by TNF-a stimulations, wherein NF-kB is a transducing molecule (Cell 1995, 80:529-532). It 
50 has also been revealed that many inflammation-related genes, including IL-2, IL-6 and G-CSF, are varied in the ex- 
pression levels thereof in response to the signal through the pathway (Trends Genet. 1999, 15(6): 229-235). It is 
assumed that genes whose expression levels are varied in response to the stimulation of TNF-a also participate in 
inflammation. 

[0155] Further, the infection of Helicobacter pylori to the gastric epithelia is known to cause gastritis and gastrodu- 
55 odenal ulcer (Mebio 2000, July, 17(7): 16-33). Thus, the genes whose expression levels are altered depending on co- 
culturing cells with Helicobacter pylori may be involved in gastritis and gastroduodenal ulcer. A recent study has sug- 
gested that Helicobacter pylori strongly activates the NF-kB pathway (Gastroenterology 2000, 119: 97-1 08). 
[0156] THP-1 cell, which is a human monocyte cell line, was cultured in the presence of TNF- a (Tumor Necrosis 
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Factor-alpha). The genes whose expression levels were altered owing to the presence of TNF-a were searched for, 
and the result showed that the clones whose expression levels were increased owing to the presence of TNF-a were 
ASTRO20055530, ASTRO20055930, ASTRO20088950, BRAMY20027390, BRAMY200761 30, BRAMY2011841 0, 
BRAMY201 25360, BRAMY20237190, BRCAN20001480, BRHIP1 0000720, CD34C20001750, CTONG20078340, 

5 CTONG20085210, DFNES20063460, FCBBF20029280, FCBBF20033360, FCBBF30078600, FEBRA20007820, 
FEBRA20031280, FEBRA20031810, FEBRA20040290, HLUNG20041540, HLUNG20092530, MESAN20021860, 
MESAN20067430, MESAN20084150, NT2NE20092950, NT2RP 7003 1070, OCBBF20012520, OCBBF201 42290, 
OCBBF201 65900, OCBBF20 170350, OCBBF20 176650, PLACE60006300, PROST20011160, PROST201 06060, 
SPLEN20040780, SPLEN201 10860, SPLEN201 77400, TESTI20030610, TESTI20043130, TESTI20059370, 

10 TESTI20254480, THYMU 10004280, THYMU20030460, THYMU20062520, THYMU20078240, THYMU20150190, 
TRACH20090060, TRACH20 125620, UTERU20026620, UTERU20045200, UTERU20064120, UTERU20 103200. 
[01 57] On the other hand, the clones whose expression levels were decreased owing to the presence of TNF-a were 
BNGH420052350, BRACE20052530, BRAMY20003880, CTONG20 170940, FCBBF30022680, FCBBF30225930, 
FCBBF30257370, FEBRA20046280, KIDNE20084030, KIDNE201 88630, NT2NE20082130, OCBBF201 10210, 

15 PLACE60061370, PROST20041460, PROST20075280, PROST20110120, SMINT20006020, TESTI20046540, 
TESTI20057200, TESTI201 13940, TESTI20257910, TESTI20262940, TRACH20 149740. 
[0158] These clones are inflammation- related genes. 

[0159] MKN45, which is a gastric cancer cell line, was co-cultured with Helicobacter pylori. The genes whose ex- 
pression levels were altered owing to the presence of Helicobacter pylori were searched for, and the result showed 

20 that the clones whose expression levels were increased owing to the presence of Helicobacter pylori were 

BRAMY20028530, BRAMY20035380, OCBBF201 70350, PROST20011160, SKMUS20091 900, SPLEN20040780, 
THYMU20078240, TR AC H20 190460, UTERU20045200, UTERU20064120, ASTRO2005553 CTONG20 170940, 
FEBRA20040290, MESAN20067430, PROST2001 6760, THYMU 10004280, TRACH20090060, UTERU20041970, 
OCBBF201 42290, TESTI20030610. 

25 [01 60] On the other hand, the clones whose expression levels were decreased owing to the presence of Helicobacter 
pylori were 

ASTRO20088950, BRACE20052530, BRAMY20003880, BRAMY20027390, BRAMY20036530, BRAMY2011841 0, 
BRHIP20000210, FCBBF20032930, FCBBF30022680, FCBBF301 69870, FEBRA201 82030, KIDNE201 82540, 
LIVER20007750, MES AN 2002 1220, NT2NE20059210, NT2NE20082130, OCBBF201 55030, PROST20065100, 
30 PROST20075280, SPLEN201 10860, TESTI20057200, TESTI201 13940, TESTI201 49880, TESTI20151 800, 
TESTI201 98600, TESTI20257910, THYMU20046770, THYMU20058550, THYMU20150190, FCBBF20033360, 
FCBBF30257370, FEBRA20098040, SMINT20006020. 
[0161] These clones are involved in gastritis or gastroduodenal ulcer. 

[0162] For example, if the polypeptide encoded by thecDNA of the present invention is a regulatory factor of cellular 
35 conditions such as growth and differentiation, it can be used for developing medicines as follows. The polypeptide or 
antibody provided by the invention is injected into a certain kind of cells by microinjection. Then, using the cells, it is 
possible -to screen low molecular weight compounds, etc. by measuring the change in the cellular conditions, or the 
activation or inhibition of a particular gene. The screening can be performed as follows. 

[0163] First, the polypeptide is expressed and purified as recombinant. The purified polypeptide is microinjected into 
40 cells such as various cell lines, or primary culture cells, and the cellular change such as growth and differentiation can 
be examined. Alternatively, the induction of genes whose expression is known to be involved in a particular change of 
cellular conditions may be detected by the amount of mRNA or polypeptide. Alternatively, the amount of intracellular 
molecules (low molecular weight compounds, etc.) that is changed by the function of the gene product (polypeptide) 
which is known to be involved in a particular change of cellular conditions may be detected. The compounds to be 
45 screened (both low and high molecular compounds are acceptable) can be added to the culture media and assessed 
for their activity by measuring the change of the cellular conditions. 

[0164] Instead of microinjection, cell lines introduced with the gene obtained in the invention can be used for the 
screening. If the gene product is turn out to be involved in a particular change in the cellular conditions, the change of 
the product can be used as a measurement for screening. Once a compound is screened out which can activate or 
50 inhibit the function of the polypeptide of the invention, it can be applied for developing medicines. 

[01 65] If the polypeptide encoded by the cDNA of the present invention is a secretory protein, membrane protein, or 
protein involved in signal transduction, glycoprotein, transcription, or diseases, it can be used in functional assays for 
developing medicines. 

[0166] In case of a membrane protein, it is most likely to be a polypeptide that functions as a receptor or ligand on 
5 5 the cell surface. Therefore, it is possible to reveal a new relationship between a ligand and receptor by screening the 
membrane protein of the invention based on the binding activity with the known ligand or receptor. Screening can be 
performed according to the known methods. 

[0167] For example, a ligand against the polypeptide of the invention can be screened in the following manner. 
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N amely, a ligand that binds to a specific polypeptide can be screened by a method comprisi ng the steps of: (a) contacting 
a test sample with the polypeptide of the invention or a partial peptide thereof, or cells expressing these, and (b) 
selecting a test sample that binds to said polypeptide, said partial peptide, or said cells. 

[0168] On the other hand, for example, screening using cells expressing the polypeptide of the present invention 
5 that is a receptor protein can also be performed as follows. It is possible to screen receptors that is capable of binding 
to a specific polypeptide by using procedures (a) attaching the sample cells to the polypeptide of the invention or its 
partial peptide, and (b) selecting cells that can bind to the said polypeptide or its partial peptide. 
[0169] In a following screening as an example, first the polypeptide of the invention is expressed, and the recombinant 
polypeptide is purified. Next, the purified polypeptide is labeled, binding assay is performed using a various cell lines 
10 or primary cultured cells, and cells that are expressing a receptor are selected (Growth and differentiation factors and 
their receptors, Shin-Seikagaku Jikken Kouza Vol.7 (1991) Honjyo, Arai, Taniguchi, and Muramatsu edit, p203-236, 
Tokyo-Kagaku-Doujin). A polypeptide of the invention can be labeled with Rl such as 125 l, and enzyme (alkaline phos- 
phatase etc.). 

[0170] Alternatively, a polypeptide of the invention may be used without labeling and then detected by using a labeled 
15 antibody against the polypeptide. The cells that are selected by the above screening methods, which express a receptor 
of the polypeptide of the invention, can be used for the further screening of an agonists or antagonists of the said 
receptor. 

[0171] Once the ligand binding to the polypeptide of the invention, the receptor of the polypeptide of the invention 
or the cells expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the 
20 ligand and receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of 
the receptor, for example) by utilizing the binding activities. 

[01 72] When the polypeptide of the invention is a receptor, the screening method comprises the steps of (a) contacting 
the polypeptide of the invention or cells expressing the polypeptide of the invention with the ligand, in the presence of 
a test sample, (b) detecting the binding activity between said polypeptide or cells expressing said polypeptide and the 

25 ligand, and (c) selecting a compound that reduces said binding activity when compared to the activity in the absence 
of the test sample. Furthermore, when the polypeptide of the invention is a ligand, the screening method comprises 
the steps of (a) contacting the polypeptide of the invention with its receptor or cells expressing the receptor in the 
presence of samples, (b) detecting the binding activity between the polypeptide and its receptor or the cells expressing 
the receptor, and (c) selecting a compound that can potentially reduce the binding activity compared to the activity in 

30 the absence of the sample. 

[0173] Samples to screen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the polypeptide of the invention can also be 
used as a sample. 

35 [0174] A compound isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the polypeptide. By utilizing an assay that monitors a change in the intracellular signaling such as phosphorylation 
which results from reduction of the binding between the polypeptide and its receptor, it is possible to identify whether 
the obtained compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a 
molecule that can inhibit the interaction between the polypeptide and its associated proteins (including a receptor) in 

40 vivo. Such compounds can be used for developing drugs for precaution or cures of a disease in which the polypeptide 
is involved. 

[0175] Secretory proteins may regulate cellular conditions such as growth and differentiation. It is possible to find 
out a novel factor that regulates cellular conditions by adding the secretory protein of the invention to a certain kind of 
cell, and performing a screening by utilizing the cellular changes in growth or differentiation, or activation of a particular 
4 5 gene. 

[01 76] The screening can be performed, for example, as follows. First, the polypeptide of the invention is expressed 
and purified in a recombinant form. Then, the purified polypeptide is added to a various kind of cell lines or primary 
cultured cells, and the change in the cell growth and differentiation is monitored. The induction of a particular gene 
that is known to be involved in a certain cellular change is detected by the amounts of mRNA and polypeptide. After- 

50 natively, the amount of an intracellular molecule (low-molecular-weight compounds, etc.) that is changed by the function 
of a gene product (polypeptide) that is known to function in a certain cellular change is used for the detection. 
[0177] Once the screening reveals that then polypeptide of the invention can regulate cellular conditions or the func- 
tions, it is possible to apply the polypeptide as a pharmaceutical and diagnostic medicine for related diseases by itself 
or by altering a part of it into an appropriate composition. 

55 [0178] As is above described for membrane proteins, the secretory protein provided by the invention may be used 
to explore a novel ligand-receptor interaction using a screening based on the binding activity to a known ligand or 
receptor. A similar method can be used to identify an agonist or antagonist. The resulting compounds obtained by the 
methods can be a candidate of a compound that can inhibit the interaction between the polypeptide of the invention 
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and an interacting molecule (including a receptor). The compounds may be able to use as a preventive, therapeutic, 
and diagnostic medicine for the diseases, in which the polypeptide may play a certain role. 

[0179] Proteins involved in signal transduction or transcription may be a factor that affects a certain polypeptide or 
gene in response to intracellular/extracellular stimuli. It is possible to find out a novel factor.that can affect a polypeptide 
5 or gene by expressing the polypeptide provided by the invention in a certain types of cells, and performing a screening 
utilizing the activation of a certain intracellular polypeptide or gene. 

[0180] The screening may be performed as follows. First, a transformed cell line expressing the polypeptide is ob- 
tained. Then, the transformed cell line and the untransformed original cell line are compared for the changes in the 
expression of a certain gene by detecting the amount of its mRNA or polypeptide. Alternatively, the amount of an 

10 intracellular molecule (low molecular weight compounds, etc.) that is changed by the function of a certain gene product 
(polypeptide) may be used for the detection. Furthermore, the change of the expression of a certain gene can be 
detected by introducing a fusion gene that comprises a regulatory region of the gene and a marker gene (luciferase, 
P-galactosidase, etc.) into a cell, expressing the polypeptide provided by the invention into the cell, and estimating the 
activity of a marker gene product (polypeptide). 

15 [0181] If the polypeptide or gene of the invention is involved in diseases, it is possible to screen a gene or compound 
that can regulate its expression and/or activity either directly or indirectly by utilizing the polypeptide of the present 
invention. 

[0182] For example, the polypeptide of the invention is expressed and purified as a recombinant polypeptide. Then, 
the polypeptide or gene that interacts with the polypeptide of the invention is purified, and screened based on the 

20 binding. Alternatively, the screening can be performed by adding with a compound of a candidate of the inhibitor added 
in advance and monitoring the change of binding activity. In another method, a transcription regulatory region locating 
in the 5'-upstream of the gene encoding the polypeptide of the invention that is capable of regulating the expression 
of other genes is obtained, and fused with a marker gene. The fusion is introduced into a cell, and the cell is added 
with compounds to explore a regulatory factor of the expression of the said gene. 

25 [0183] The compound obtained by the screening can be used for developing pharmaceutical and diagnostic medi- 
cines for the diseases in which the polypeptide of the present invention is involved. Similarly, if the regulatory factor 
obtained in the screening is turn out to be a polypeptide, compounds that can newly affect the expression or activity 
of the polypeptide may be used as a medicine for the diseases in which the polypeptide of the invention is involved. 
[0184] If the polypeptide of the invention has an enzymatic activity, regardless as to whether it is a secretory protein, 

30 membrane protein, or proteins involved in signal transduction, glycoprotein, transcription, or diseases, a screening 
may be performed by adding a compound to the polypeptide of the invention and monitoring the change of the com- 
pound. The enzymatic activity may also be utilized to screen a compound that can inhibit the activity of the polypeptide. 
[0185] In a screening given as an example, the polypeptide of the invention is expressed and the recombinant 
polypeptide is purified. Then, compounds are contacted with the purified polypeptide, and the amount of the compound 

35 and the reaction products is examined. Alternatively, compounds that are candidates of an inhibitor are pretreated, 
then a compound (substrate) that can react with the purified polypeptide is added, and the amount of the substrate 
and the reaction products is examined. 

[0186] The compounds obtained in the screening may be used as a medicine for diseases in which the polypeptide 
of the invention is involved. Also they can be applied for tests that examine whether the polypeptide of the invention 

40 functions normally in vivo. 

[0187] Whether the secretory protein, membrane protein, signal transduction-related protein, glycoprotein-related 
protein, or transcription-related protein of the present invention is a novel protein involved in diseases or not is deter- 
mined in another method than described above, by obtaining a specific antibody against the polypeptide of the invention , 
and examining the relationship between the expression or activity of the polypeptide and a certain disease. In an 

4 5 alternative way, it may be analyzed referred to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles R. 
edit, (1996) in the series of "Method in Molecular Biology" (Humana Press). 

[0188] Proteins involved in diseases are targets of screening as mentioned, and thus are very useful in developing 
drugs which regulate their expression and activity. Also, the proteins are useful in the medicinal industry as a diagnostic 
marker of the related disease or a target of gene therapy. 

50 [0189] Compounds isolated as mentioned above can be administered patients as it is, or after formulated into a 
pharmaceutical composition according to the known methods. For example, a pharmaceutical^ acceptable carrier or 
vehicle, specifically sterilized water, saline, plant oil, emuisifier, or suspending agent can be mixed with the compounds 
appropriately. The pharmaceutical compositions can be administered to patients by a method known to those skilled 
in the art, such as intraarterial, intravenous, or subcutaneous injections. The dosage may vary depending on the weight 

55 or age of a patient, or the method of administration, but those skilled in the art can choose an appropriate dosage 
properly. If the compound is encoded by polynucleotide, the polynucleotide can be cloned into a vector for gene therapy, 
and used for gene therapy. The dosage of the polynucleotide and the method of its administration may vary depending 
on the weight or age of a patient, or the symptoms, but those skilled in the art can choose properly. 
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[0190] The present invention further relates to databases comprising at least a sequence of polynucleotide and/or 
polypeptide, or a medium recorded in such databases, selected from the sequence data of the nucleotide and/or the 
amino acids indicated in Table 1 . The term "database" means a set of accumulated information as machine-searchable 
and readable information of nucleotide sequence. The databases of the present invention comprise at least one of the 

5 novel nucleotide sequences of polynucleotides provided by the present invention. The databases of the present inven- 
tion can consist of only the sequence data of the novel polynucleotides provided by the present invention or can com- 
prise other information on nucleotide sequences of known full-length cDNAs or ESTs. The databases of the present 
invention can be comprised of not only the information on the nucleotide sequences but also the information on the 
gene functions revealed by the present invention. Additional information such as names of DNA clones carrying the 

10 full-length cDNAs can be recorded or linked together with the sequence data in the databases. 

[0191] The database of the present invention is useful for gaining complete gene sequence information from partial 
sequence information of a gene of interest. The database of the present invention comprises nucleotide sequence 
information of full-length cDNAs. Consequently, by comparing the information in this database with the nucleotide 
sequence of a partial gene fragment yielded by differential display method or subtraction method, the information on 

15 the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting clue. 
[01 92] The sequence information of the full-length cDNAs constituting the database of the present invention contains 
not only the information on the complete sequences but also extra information on expression frequency of the genes 
as well as homology of the genes to known genes and known polypeptides. Thus the extra information facilitates rapid 
functional analyses of partial gene fragments. Further, the information on human genes is accumulated in the database 

20 of the present invention, and therefore, the database is useful for isolating a human homologue of a gene originating 
from other species. The human homologue can be isolated based on the nucleotide sequence of the gene from the 
original species. 

[0193] At present, information on a wide variety of gene fragments can be obtained by differential display method 
and subtraction method. In general, these gene fragments are utilized as tools for isolating the full-length sequences 

25 thereof. When the gene fragment corresponds to an already-known gene, the full-length sequence is easily obtained 
by comparing the partial sequence with the information in known databases. However, when there exists no information 
corresponding to the partial sequence of interest in the known databases, cDNA cloning should be carried out for the 
full-length cDNA. It is often difficult to obtain the full-length nucleotide sequence using the partial sequence information 
as an initial clue. If the full-length of the gene is not available, the amino acid sequence of the polypeptide encoded by 

30 the gene remains unidentified. Thus the database of the present invention can contribute to the identification of full- 
length cDNAs corresponding to gene fragments, which cannot be revealed by using databases of known genes. 
[0194] The present invention has provided 1 970 polynucleotides. As has not yet proceeded the isolation of full-length 
cDNA within the human, the invention has great significance. It is known that secretory proteins, membrane proteins, 
signal transduction-related proteins, glycoprotein-related proteins, transcription -related proteins, and so on are involved 

35 in many diseases. The genes and proteins involved in diseases are useful for developing a diagnostic marker or med- 
icines for regulation of their expression and activity, or as a target of gene therapy. 

[0195] In particular, cDNA assumed to encode secretory proteins, which were provided by this invention, are very 
important for the industry since the encoded proteins themselves are expected to be useful as pharmaceutical agents 
and many disease-related genes may be included in them. In addition, membrane proteins, signal transduction-related 
40 proteins, transcription -related proteins, disease-related proteins, and genes encoding them can be used as indicators 
for diseases, etc. These cDNA are also very important for the industry, which are expected to regulate the activity or 
expression of the encoded protein to treat diseases, etc. 

[0196] Any patents, patent applications, and publications cited herein are incorporated by reference. 
[0197] The invention is illustrated more specifically with reference to the following examples, but is not to be construed 
45 as being limited thereto. 

EXAMPLE 1 

Preparation of cDNA library by oligo-capping 

50 ~" " 

(1) Extraction and purchase of mRNA 

[01 98] Total RNAs as mRNA sources were extracted from human tissues (shown below) by the method as described 
in the reference (J. Sambrook, E. F. Fritsch & T. Maniatis, Molecular Cloning Second edition, Cold Spring harbor Lab- 
55 oratory Press, 1989). Further, by the method as described in the reference (J. Sambrook, E. F. Fritsch & T. Maniatis, 
Molecular Cloning Second edition, Cold Spring harbor Laboratory Press, 1989), total RNAs as mRNA sources were 
extracted from human culture cells and human primary culture cells (shown below) which had been cultivated by the 
methods described in the catalogs. 
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[0199] The library names and the origins are indicated below in the order of "Library name: Origin". When a library 
was prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

Extraction of mRNA from human tissues> 

5 

[0200] 

NTONG: Normal tongue; 
CTONG: Tongue cancer; 
10 FCBBF: Fetal brain; 

OCBBF: Fetal brain; 
PLACE: Placenta; 

SYNOV: Synovial membrane tissue (from rheumatioid arthritis). 
15> Extraction of mRNA from culture cells> 
[0201] 

BNGH4: H4 cells (ATCC #HTB-148); 
20 IMR32: IMR32 cells (ATCC #CCL-127); 

SKNMC: SK-N-MC cells (ATCC #HTB-10); 

3NB69: NB69 cells (RCB #RCB0480); 

BGGI1 : GI1 cells (RCB #RCB0763); 

NB9N4: NB9 cells (RCB #RCB0477); 
25 SKNSH: SK-N-SH cells (RCB #RCB0426); 

NT2RM: NT2 cells (STARATAGENE #204101); 

NT2RP: NT2 cells treated with retinoic acid (RA) for 5 weeks to induce the differentiation; 

NT2RI: NT2 cells treated with RAfor5 weeks to induce the differentiation, followed by the treatment with the growth 

inhibitor for 2 weeks; 

30 NT2NE: NT2 cells were treated with RA and the growth inhibitor for the neuronal differentiation, and the resultant 

neurons were concentrated and harvested (NT2 Neuron); 

NTISM: NT2 cells (STARATAGENE #2041 01 ) were treated with RA for 5 weeks to induce the differentiation, and 
then treated with the growth inhibitor for 2 weeks; mRNA was prepared from the cells and a cDNA library was 
constructed from the mRNA; the cDNAs of the library whose nucleotide sequences were shared by those of mRNAs 
35 from undifferentiated NT2 cells were subtracted by using a Subtract Kit (Invitrogen #K4320-01) ; the subtracted 

library (NT2RI-NT2RM) was provided by this procedure. 

[0202] RCB indicates that the cell was provided by the Cell Bank, RIKEN GENE BANK, The Institute of Physical and 
Chemical Research; ATCC indicates that the cell was provided by American Type Culture Collection. 

40 

Extraction of mRNA from primary culture cells> 
[0203] 

4 5 ASTRO: Normal human astrocyte NHA5732, Takara Shuzo #CC2565; 

DFNES: Normal human dermal fibroblast (neonatal skin); NHDF-Neo NHDF2564, Takara Shuzo #CC2509; 

MESAN: Normal human mesangial cell NHMC56046-2, Takara Shuzo #CC2559; 

NHNPC: Normal human neural progenitor cell NHNP5958, Takara Shuzo #CC2599; 

PEBLM: Normal human peripheral blood mononuclear cell HPBMC5939, Takara Shuzo #CC2702; 
50 HSYRA: Human synoviocyte HS-RA (from rheumatioid arthritis), Toyobo #T404K-05; 

PUAEN: Normal human pulmonary artery endothelial cells, Toyobo #T302K-05; 

UMVEN: Normal human umbilical vein endothelial cell HUVEC, Toyobo #T200K-05; 

HCASM: Normal human coronary artery smooth muscle cell HCASMC, Toyobo #T305K-05; 

HCHON: Normal human chondrocyte HC, Toyobo #T402K-05; 
55 HHDPC: Normal human dermal papilla cell HDPC, Toyobo #THPCK-001; CD34C: CD34+ cells (AllCells, LLC 

#CB14435M) ; 

D30ST: CD34+ cells treated with the osteoclast differentiation factor (ODF) for 3 days to induce the differentiation; 
D60ST: CD34+ cells treated with ODF for 6 days to induce the differentiation; 
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D90ST: CD34+ cells treated with ODF for 9 days to induce the differentiation. 

[0204] Then, total RNAs extracted from the following human tissues were purchased and used as mRNA sources. 
The library names and the origins are indicated below in the order of "Library name: Origin". When a library was 
5 prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

<Purchase of total RNA containing mRNA extracted from human tissues> 

[0205] 

10 

ADRGL: Adrenal gland, CLONTECH #64016-1; 

BRACE: Brain (cerebellum), CLONTECH #64035-1 ; 

BRAWH: Whole brain, CLONTECH #64020-1; 

FEBRA: Fetal brain, CLONTECH #64019-1; 
15 FELIV: Fetal liver, CLONTECH #6401 8-1 ; 

HEART: Heart, CLONTECH #64025-1; 

HLUNG: Lung, CLONTECH #64023-1; 

KIDNE: Kidney, CLONTECH #64030-1; 

LIVER: Liver, CLONTECH #64022-1; 
20 MAMGL: Mammary Gland, CLONTECH #64037-1 ; 

PANCR: Pancreas, CLONTECH #64031-1; 

PROST: Prostate, CLONTECH #64038-1 ; 

SALGL: Salivary Gland, CLONTECH #64026-1 ; 

SKMUS: Skeletal Muscle, CLONTECH #64033-1; 
25 SMINT: Small Intestine, CLONTECH #64039-1 ; 

SPLEN: Spleen, CLONTECH #64034-1; 

STOMA: Stomach, CLONTECH #64090-1; 

TBAES: Breast (Tumor), CLONTECH #64015-1 ; 

TCERX: Cervix (Tumor), CLONTECH #64010-1; 
30 TCOLN: Colon (Tumor), CLONTECH #64014-1 ; 

TESTI: Testis, CLONTECH #64027-1 ; 

THYMU: Thymus, CLONTECH #64028-1; 

TLUNG: Lung (Tumor), CLONTECH #64013-1 ; 

TOVAR: Ovary (Tumor), CLONTECH #64011-1 ; 
35 TRACH: Trachea, CLONTECH #64091 -1 ; 

TUTER: Uterus (Tumor), CLONTECH #64008-1; 

UTERU: Uterus, CLONTECH #64029-1; 

ADIPS: Adipose, Invitrogen #D6005-01; 

BLADE: Bladder, Invitrogen #D6020-01; 

BRALZ: Cerebral cortex from an Alzheimer patient (Brain, cortex, Alzheimer), Invitrogen #D6830-01 ; 

CERVX: Cervix, Invitrogen #06047-01 ; 

COLON: Colon, Invitrogen #D6050-0; 

NESOP: Esophagus, Invitrogen #D6060-01; 

PERIC: Pericardium, Invitrogen #D61 05-01; 
45 RECTM: Rectum, Invitrogen #D6110-01 ; 

TESOP: Esophageal (Tumor), Invitrogen #D6860-01; 

TKIDN: Kidney (Tumor), Invitrogen #06870-01 ; 

TLIVE: Liver (Tumor), Invitrogen #D6880-01; 

TSTOM: Stomach (Tumor), Invitrogen #D6920-01; 
50 BEAST: Adult breast, STARATAGENE #735044; 

FEHRT: Fetal heart, STARATAGENE #738012; 

FEKID: Fetal kidney, STARATAGENE #738014; 

FELNG: Fetal lung, STARATAGENE #738020; 

NOVAR: Adult ovary, STARATAGENE #735260; 
55 BRASW: subtracted library (BRALZ-BRAWH). A cDNA library was constructed from mRNA prepared from tissues 

of cerebral cortex obtained from an Alzheimer patient [BRALZ: Cerebral cortex from an Alzheimer patient (Brain, 

cortex, Alzheimer), Invitrogen #D6830-01]; the cDNAs of this library whose nucleotide sequences were shared by 

those of mRNAs from whole brain tissue [BRAWN: Whole brain, CLONTECH #64020-1 ] were subtracted by using 
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a Subtract Kit (Invitrogen #K4320-01). 

[0206] Further, mRNAs extracted and purified as poly A(+) RNAs from the human tissues shown below were pur- 
chased. A cDNA library was prepared from an RNA mixture in which the poly A(+) RNA from each tissue had been 
5 combined with poly A(-) RNA. The poly A(-) RNA was prepared by removing poly A(+) RNA from the total RNA of whole 
brain tissue (CLONTECH #64020-1) by using oligo dT cellulose. The library names and the origins are indicated below 
in the order of "Library name: Origin". 

<Purchase of mRNAs of human tissues as poly A(+) RNAs> 

10 

[0207] 

BRAMY: Brain (amygdala), CLONTECH #6574-1 ; 
BRCAN: Brain (caudate nucleus), CLONTECH #6575-1; 
15 BRCOC: Brain (corpus callosum), CLONTECH #6577-1 ; 

BRHIP: Brain (hippocampus), CLONTECH #6578-1; 
BRSSN: Brain (substantia nigra), CLONTECH #6580-1; 
BRSTN: Brain (subthalamic nucleus), CLONTECH #6581-1; 
BRTHA: Brain (thalamus), CLONTECH #6582-1. 

20 

(2) Preparation of cDNA library 

[0208] cDNA library was prepared from each RNA by the improved method (WO 01/04286) of oligo capping [M. 
Maruyama and S. Sugano, Gene, 138: 171-174 (1994)]. A series of procedures, BAP (Bacterial Alkaline Phosphatase) 

25 treatment, TAP (Tobacco Acid Pyrophosphatase) treatment, RNA ligation, first strand cDNA synthesis and RNA re- 
moval, were carried out using the oligo-cap linker (SEQ ID NO: 4093) and oligo dT primer (SEQ ID NO: 4094), as 
described in WO 01/04286. Then, the single-stranded cDNA was converted to a double-stranded cDNA by PCR 
(polymerase chain reaction) using 5' (SEQ ID NO: 4095) and 3' (SEQ ID NO: 4096) PCR primers, and then digested 
with Sfi\. Then, a fraction of cDNA fragments, typically 2-kb or longer (3-kb or longer in some cases), was unidirectionally 

30 cloned into a Dra\ 1 1 -digested pME1 8SFL3 vector (Figure 1 ) (GenBank AB009864, Expression vector); the cDNA library 
was thus prepared. 

[0209] The names of cDNA libraries, which were used in the analysis of full-length cDNA sequences, and their origins 
are shown in Table 2. 

35 Table 2 



Library 


Type 


Origin etc. 


ADRGL 


Tissue 


Adrenal gland (CLONTECH #64016-1) 


ASTRO 


Primary culture cell 


Normal Human Astrocyte NHA5732 (Takara Shuzo #CC2565) 


BGGI1 


Culture cell 


GI1 cells (RCB#RCB0763) 


BNGH4 


Culture cell 


H4 cells (ATCC #HTB-148) 


BRACE 


Tissue 


Brain, cerebellum (CLONTECH #64035-1) 


BARMY 


Tissue 


Brain, amygdala (CLONTECH #6574-1) 


BRAWH 


Tissue 


Brain, whole (CLONTECH #64020-1) 


BRCAN 


Tissue 


Brain, caudate nucleus (CLONTECH #6575-1) 


BRCOC 


Tissue 


Brain, corpus callosum (CLONTECH #6577-1) 


BRHIP 


Tissue 


Brain, hippocampus (CLONTECH #6578-1) 


BRSSN 


Tissue 


Brain, substantia nigra (CLONTECH #6580-1) 


CD34C 


Primary culture cell 


CD34+ cells (AllCells, LLC #CB14435M) 


CTONG 


Tissue 


Tongue, Cancer 


DFNES 


Primary culture cell 


Normal Human Dermal Fibroblasts (Neonatal Skin); NHDF-Neo 
NHDF2564 (Takara Shuzo #CC2509) 
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Table 2 (continued) 





Liurary 


Type 


Origin etc. 




rODDr 


i issue 


orain, retai 


5 


rrpQA 
rtonM 


Tissue 


brain, retai (oluin i tun #b4Ui9-i) 




nonUiN 


Primary culture cell Tissue 


human cnonarocytes no ( loyobo # 1 402K-05) 




nbAn I 




Heart (CLUN I ton ffb4U<£b- 1 ) 


10 


nnUrO 


Primary culture cell 


Unman >J nim n| rtorxiHo r*r\ 1 to 1 innr> /TV» J1TU n/^lX ft A 4 \ 

numan aermai papiua ceils MUru ( loyooo # I nrOK-ooi ) 




ill 1 iKtr* 
MLUNb 


Tissue 


1 linn / /""» 1 r"\ MTCPU UC: A AOO 4 \ 

Lung (CLUN 1 ton ffb402o-1) 




KIDNE 


Tissue 


Kidney (CLONTECH #64030-1) 


15 


LIVER 


Tissue 


Liver (CLONTECH #64022-1) 


MESAN 


Primary culture cell 


Normal human mesangial cells NHMC56046-2 (Takara Shuzo#CC2559) 




NESOP 


Tissue 


Esophagus (Invitrogen #D6060-01) 




NT2NE 


Culture cell 


NT2 cells concentrated after differentiation (NT2 Neuron) 


20 


NT2RI 


Culture ceil 


NT2 cells treated by growth inhibitor for 2 weeks after RA induction for 5 
weeks 




NT2RP 


Culture cell 


NT2 cells treated by RA for 5 weeks 




NTONG 


Tissue 


Tongue 


25 


OCBBF 


Tissue 


Brain, Fetal 




PANCR 


Tissue 


Pancreas (CLONTECH #64031-1) 


30 


PEBLM 


Primary culture cell 


Human peripheral blood mononuclear cells HPBMC5939 (Takara Shuzo 
#CC2702) 


PLACE 


Tissue 


Placenta 




PROST 


Tissue 


Prostate (CLONTECH #&4038-1 ) 




PUAEN 


Primary culture cell 


Human pulmonary artery endothelial cells (Toyobo #T302K-05) 


35 


oALbL 


Tissue 


Salivary Gland (CLONTECH #64026-1) 




SKMUS 


Tissue 


Skeletal Muscle (CLONTECH #64033-1) 




oKNMO 


Culture cell 


SK-N-MC cells (ATCC #HTB-10) 


40 




Culture cell 


SK-N-SH cells (RCB #RCB0426) 


O IV A 1 K IT 

SMINT 


Tissue 


Small Intestine (CLONTECH #64039-1) 




SPLEN 


Tissue 


Spleen (CLONTECH #64034-1) 




TCCHD 

1 toUr 


Tssue 


Esophageal, Tumor (Invitrogen #D6860-01) 




1 CO 1 1 




Tactic iC\ OMTCPH &fsAC\07 i\ 
I6SLIS ^OLVJIN i ton ffD4Utrl ) 




THYMU 


Tissue 


Thymus (CLONTECH #64028-1) 




TKIDN 


Tissue 


Kidney, Tumor (Invitrogen #D6870-01) 


50 


TRACH 


Tissue 


Trachea (CLONTECH #64091 -1 ) 


UMVEN 


primary culture cell 


Human umbilical vein endothelial cells HUVEC (Toyobo #T200K-05) 




UTERU 


Tissue 


Uterus (CLONTECH #64029-1) 



[0210] The cDNA library with the high fullness ratio (the fullness ratio of 5'-end, which was calculated for each cDNA 
library by using the protein coding region found in known mRNA species as an index, was 90% in average) prepared 
by the improved oligo-capping method was constructed by using a eukaryotic expression vector pME18SFL3. The 
vector contains SRa promoter and SV40 small t intron in the upstream of the cloning site, and SV40 polyA added signal 
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sequence site in the downstream. As the cloning site of pME18SFL3 has asymmetrical Dralll sites, and the ends of 
cDNA fragments contain Sfil sites complementary to the Dralll sites, the cloned cDNA fragments can be inserted into 
the downstream of the SRa promoter unidirectionally. Therefore, clones containing full-length cDNA can be expressed 
transiently by introducing the obtained plasmid directly into COS cells, etc. Thus, the clones can be analyzed very 
5 easily in terms of the proteins that are the gene products of the clones, or in terms of the biological activities of the 
proteins. 

(3) Assessment of the 5'-end completeness of clones derived from the cDNA library prepared by oligo-capping 

10 [0211] With respect to the plasmid DNAs of clones derived from the libraries, the nucleotide sequences of cDNA 5'- 
ends (3'-ends as well in some cases) were determined in a DNA sequencer (ABI PRISM 3700, PE Biosystems), after 
sequencing reaction was conducted by using a DNA sequencing reagent (BigDye Terminator Cycle Sequencing FS 
Ready Reaction Kit, PE Biosystems) according to the manual. A database was constructed based on the obtained data. 
[0212] The 5'-end completeness of about 770,000 clones derived from the human cDNA libraries prepared by the 

15 improved oligo-capping method was determined by the following method. The clones whose 5'-end sequences were 
consistent with those of known human mRNA in the public database were judged to be "full-length" if they had a longer 
5'-end sequence than that of the known human mRNA; or even though the 5'-end sequence was shorter, if it contained 
the translation initiation codon it was judged to have the 'full-length" sequence. Clones which did not contain the trans- 
lation initiation codon were judged to be "not-full-length". The fullness ratio ((the number of full-length clones)/(the 

20 number of full-length and not-full-length clones)) at the 5'-end of the cDNA clones was determined by comparing with 
known human mRNA. As a result, the fullness ratio of the 5*-ends was 90%. The result indicates that the fullness ratio 
at the 5'-end sequence was extremely high in the human cDNA clones obtained by the oligo-capping method. 

EXAMPLE 2 

25 

Sequencing analysis of cDNA ends and selection of full-length clones 

[0213] With respect to the plasmid DNAs of clones obtained from each cDNA library, the 5'-end nucleotide sequences 
of the cDNAs were determined in a DNA sequencer (ABI PRISM 3700, PE Biosystems), after sequencing reaction 
30 was conducted by using a DNA sequencing reagent (Dye Terminator Cycle Sequencing FS Ready Reaction Kit, dRhod- 
amine Terminator Cycle Sequencing FS Ready Reaction Kit or BigDye Terminator Cycle Sequencing FS Ready Re- 
action Kit, PE Biosystems) according to the manual. A database was constructed using the data obtained. 
[0214] For the analyzed 5'-end sequences of cDNA clones, the data with the annotation of "complete cds" in the 
GenBank and UniGene were searched by BLAST homology search. When identical to certain human mRNA sequenc- 
es es, such cDN A clones were excluded. Then, clustering was carried out. When the identity was 90% or higher, and the 
length of consensus sequence was 50 base pairs or longer, the cDNA clones were assumed to belong to an identical 
cluster, and thus clustered. cDNA clones longer in the 5' direction were selected from the members belonging to a 
cluster; if required, the 3'-end sequences of the selected clones were determined by the same analysis method as 
used to determine the 5*-end sequences. The data of the end sequences obtained were analyzed, and then the clones 
40 forming a sequence contig at 5'- and 3'-ends were excluded. Further, as mentioned above, the data was analyzed 
again by BLAST homology search; when identical to certain human mRNA sequences (including sequences patented 
and applied for), the cDN A clones were excluded. Thus, the cDN As clones to be analyzed for their nucleotide sequence 
were obtained. 

45 EXAMPLE 3 

Analysis of the full-length nucleotide sequences 

[0215] The full-length nucleotide sequences of the selected clones were determined. The nucleotide sequence de- 
50 termination was mainly performed by primer walking method comprising the dideoxy terminator method using custom- 
made synthetic DNA primers. Namely, the nucleotide sequences of the DNAs were determined in a sequencer from 
PE Biosystems, after sequencing reaction was carried out with a DNA sequencing reagent from the same supplier 
using the custom-made synthetic DNA primers according to the manual. A part of the clones were analyzed with a 
DNA sequencer from Licor. 

55 [0216] Further, the nucleotide sequences of a part of the clones were determined by the shotgun method where the 
plasmlds containing the cDNAs were digested at random were used, instead of the use of custom-made primers, by 
the same method in the DNA sequencer. The full-length nucleotide sequences were finally determined by completely 
assembling the partial nucleotide sequences obtained by the above method. 
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[0217] Then, the regions translatable to proteins were deduced from the determined full-length nucleotide sequences, 
and thereby the amino acid sequences were determined. SEQ ID NOs corresponding to the respective sequences are 
shown in Table 1 . 

5 EXAMPLE 4 

Functional prediction by homology search 

[0218] For the determined nucleotide sequences, GenBank, SwissProt, UniGene, and nr were searched by BLAST. 

10 The clones exhibiting higher homology, which were convenient to predict their functions based on the nucleotide se- 
quences and deduced amino acid sequences, were selected based on the BLAST search hit data whose P value or 
E value was 10* 4 or lower and for which the length of consensus sequence x homology = 30 or higher in the amino 
acid database search. Further, from them, representative clones were selected, which are shown as Homology Search 
Result Data in the last part herein. Accordingly, the data shown herein are merely the representative data, and the 

15 molecule exhibiting homology to each clone is not limited thereto. Further, with respect to a part of clones, the BLAST 
search hit data that did not meet the criteria as described above are not shown herein. 

EXAMPLE 5 

zo Search for signal sequence, transmembrane domain and other functional domains in the deduced amino acid 
sequences 

[0219] With respect to the amino acid sequences deduced from the full-length nucleotide sequences, the prediction 
was made for the presence of signal sequence at the amino terminus, the presence of transmembrane domain, and 

25 the presence of functional protein domains (motifs). The signal sequence at the amino terminus was searched for by 
PSORT [K. Nakai & M. Kanehisa, Genomics, 14: 897-911 (1992)]; the transmembrane domain, by SOSUI [T. Hirokawa 
et a!., Bioinformatics, 14: 378-379 (1998)] (Mitsui Knowledge Industry); the function domain, by Ram (http://www. 
sanger.ac.uk/Software/Pfam/index.shtml). The amino acid sequence in which the signal sequence at the amino termi- 
nus or transmembrane domain had been predicted to be present by PSORT or SOSUI were assumed to be a secretory 

30 or membrane protein. Further, when the amino acid sequence hrt a certain functional domain by the Pfam functional 
domain search, the protein function can be predicted based on the hit data, for example, by referring to the function 
categories on the PROSITE (httpy/www.expasy.ch/cgi-bin/prosite-list.pl). In addition, the functional domain search can 
also be carried out on the PROSITE. 

[0220] The search results obtained with the respective programs are shown below. 
35 [0221] The clones whose deduced amino acid sequences were detected to have the signal sequences by PSORT 
are as follows. 

ADRGL20021910, ADRGL20036380, ADRGL20063770, ASTR020020240, BNGH420052350, BNGH420077980, 
BRACE20054080, BRACE20 194670, BRAMY20044920, BRAMY20047560, BRAMY20 137360; BRAMY20204270, 
BRAMY20237190, BRAMY20251 750, BRAWH20020470, BRAWH20093070, BRCAN1 0001 680, BRHIP1 0000720, 

40 BRSSN20091190, CD34C20001 750, CTONG20059130, CTONG20069320, FCBBF30062490, FCBBF30 132660, 
FEBRA20039260, FEBRA20040230, FEBRA20040560, FEBRA20046280, FEBRA201 82030, HCHON1 0001660, 
HCHON20015050, HEART1 0001 490, HHDPC20088160, HLUNG20032460, HLUNG20034970, HLUNG20050760, 
HLUNG20081390, HLUNG20088750, KIDNE20134130, KIDNE20143200, LIVER20007750, LIVER20010510, 
MESAN20021220, MESAN20027900, MESAN20 095220, NT2NE20069580, NT2NE20082130, NT2NE20 167660, 

45 NT2RP70003110, OCBBF20000740, OCBBF20012520, OCBBF201 10730, OCBBF201 18720, OCBBF201 55030, 
OCBBF201 70350, OCBBF20191950, PANCR1 0000860, PEBLM20001800, PLACE60004260, PLACE60006300, 
PLACE60055590, PLACE60056910, PLACE60057860, PLACE601 04630, P LAC E60 184870, PROST20050390, 
PROST20084680, PROST201 05450, PROST201 06060, PROST20110120, SKMUS20091900, SKNMC20006350, 
SMINT20024140, SMINT20028840, SMINT20086250, SMINT20088440, SMINT20088690, SPLEN2001 7810, 

50 SPLEN20073880, SPLEN20080070, SPLEN20101950, SPLEN201 08000, SPLEN201 10860, SPLEN201 18050, 
SPLEN201 38600, SPLEN20139100, SPLEN201 93230, SPLEN201 93490, SPLEN20201830, TESTI20043130, 
TESTI20047370, TESTI20057200, TESTI20059080, TESTI20061 200, TESTI20063330, TESTI20063600, 
TESTI201 02390, TESTI201 1 61 20, TESTI201 51 800, TESTI201 66670, TESTI2021 0030, TESTI20245860, 
THYMU20020800, THYMU20046770, THYMU20050010, THYMU20054800, THYMU20055740, THYMU20083390, 

55 THYMU201 15380, TRACH20081270, TRACH201 59390, UTERU20040150, UTERU20064120, UTERU20086530, 
UTERU201 27150 

[0222] The clones whose deduced amino acid sequences were detected to have the transmembrane domains by 
SOSUI are as follows. Numerals indicate the numbers of transmembrane domains detected in the deduced amino acid 
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sequences. Of the search result, the clone name and the number of transmembrane domains are demarcated by a 
double slash mark (//). 

ADRGL20020290//1 0, ADRGL20021 91 0//2, ADRGL20063770//2, ASTRO2001 001 0//4, ASTRO20045840//3, 
ASTRO20053430//1 , ASTRO20055530//1 , ASTRO20055570//2, ASTRO20055930//2, ASTRO200751 50//1 , 

5 ASTRO20088950//1 , ASTRO20091 1 80//4, BNGH420021 680//1 , BNGH420023870//1 , BNGH420052350//1 , 
BNGH420059680//1 , BNGH420075940//1 , BNGH420087430//3, BR ACE 1 000051 0//2, BRACE20052530//1 , 
BRACE20066360//3, BRACE2006871 0//1 , BRACE20069000//5, BRACE2006911 0//1 , BRAMY1 0001 730//1 , 
BRAMY20001510//1 , BRAMY20003880//1 , BRAMY20024790//2, BRAMY20027390//2, BRAMY20028530//2, 
BRAMY20035380//2, BRAMY20045210//1 , BRAMY20050940//3, BRAMY20053910//2, BRAMY20055760//5, 

10 BRAMY20072440//8, BRAMY20083820//2, BRAMY20089770//1 , BRAMY20096930//1 , BRAMY20118410//1 , 

BRAM Y201 23400//2, BRAMY201 25550//1 , BRAMY201 2731 0//1 , BRAMY201 27760//1 , BRAMY201 35720//3, 
BRAM Y201 39440//5, BRAM Y201 5251 0//1 , BRAMY201 94680//2, BRAMY20204270//1 , B RAM Y20225320//2 , 
BRAMY202371 90//1 , BRAMY20251 750//1 , BRAMY20285650//2, BRAWH20020470//7, BRAWH2002601 0//1 , 
BRAWH20030000//2, BRAWH20039640//2, BRAWH20055330//1 , BRAWH20078 6207/2, BRAWH20093040//3, 

15 BRAWH201 85270//5, BRAWH201 90550//1 , BRCAN1 0000760//1 0, BRCAN20001 480//3, BRCAN200041 80//1 , 
BRCOC20000470//2, BRCOC20003600//1 , BRHIP10001040//5, BRHIP20000210//1 , BRSSN20001970//4, 
BRSSN20074640//2, CD34C20001 750//1 , CTONG2001 7490//1 , CTONG20029030//4, CTONG20041 260//9, 
CTONG20044870//2, CTONG20045500//2, CTONG20049480//3, CTONG20051450//1 , CTONG20056150//2, 
CTONG20059130//6, CTONG20060040//1 , CTONG20065680//1 , CTONG20071680//13, CTONG20076810//1 , 

20 CTONG20078340//2, CTONG20079590//1 , CTONG20083980//1 , CTONG20084020//2, CTONG201 67750//1 , 
CTONG201 68240//3, CTONG201 79890//5, CTONG201 83830//1 , DFNES2001 8000//2, DFNES200281 70//1 , 
DFNES20029660//8, DFNES20072990//9, DFNES20080880//1 , FCBBF20018680//3, FCBBF20029280//2, 
FCBBF20032930//1 , FCBBF20036360//2, FCBBF20054390//2, FCBBF30022680//1 , FCBBF3004261 0//7, 
FCBBF30062490//2, FCBBF30075970//1 , FCBBF30078600//3, FCBBF3009541 0//1 , FCBBF301 05440//3, 

25 FCBBF301 1 8670//1 , FCBBF301 45670//1 , FCBBF301 6451 0//1 , FCBBF301 69870//1 , FCBBF301 72330//1 , 
FCBBF30177290//2, FCBBF30179740//1 , FCBBF30195690//2, FCBBF30197840//1, FCBBF30212210//1, 
FCBBF302231 1 0//1 , FCBBF30225930//9, FCBBF3023061 0//1 , FCBBF30260480//5, FCBBF3026651 0//3, 
FCBBF30287940//1 , FCBBF50000610//2, FCBBF50004950//3, FEBRA20007820//4, FEBRA20018670//2, 
FEBRA20031280//7, FEBRA20031810//2, FEBRA20040560//4, FEBRA20057010//1 , FEBRA20080860//1 , 

30 FEBRA20084750//1 , FEBRA2011 5930//10, FEBRA20116650//4, FEBRA20121950//5, FEBRA20141980//1 , 

FEBRA201 77800//1 , FEBRA201 82030//1 , HCHON20015050//1 , HEART20031 680//1 , HHDPC1 00011 40//1 , 
HHDPC20051850//1 , HHDPC20082790//3, HHDPC200881 60//1 , HLUNG20015070//2, HLUNG20020850//2, 
HLUNG20029490//1 , HLUNG20032460//1 , HLUNG20033350//2, HLUNG20037160//2, HLUNG20050760//1 , 
HLUNG20052300//1, HLUNG20060670//1, HLUNG20065990//1, HLUNG20074330//1 , HLUNG20081390//2, 

35 HLUNG20092530//1 , KIDNE2001 6360//5, KIDNE20083150//3, KIDNE20084030//12, KIDNE20084800//2, 
KIDNE20086490//1 , KIDNE20086660//2, KIDNE20094670//1 , KIDNE20142900//1 , KIDNE20148080//1 , 
KIDNE20160960//2, KIDNE20163710//1 , KIDNE201 69180//2, KIDNE20182540//3, KIDNE20186170//1 , 
KIDNE20188630//1, KIDNE20189960//1, LIVER20007750//8, LIVER20010990//4, LIVER20030650//6, 
LIVER20038000//4, MESAN20007110//1 , MESAN20008150//3, MESAN20021220//2, MESAN20060430//1 , 

40 MESAN20067430//3, M ES AN 200841 50//3, NT2NE20018740//1 , NT2NE20039210//1 , NT2NE20053230//1 , 
NT2NE20059210//1 , NT2NE20080770//1 , NT2NE20082130//3, NT2NE20092950//1 , NT2NE20152620//1 , 
NT2NE20167660//1, NT2NE20181800//1, NT2RI20016240//2, NT2RI20021200//3, NT2RI20033920//5, 
NT2RP70027790//2, NT2RP70031 070//1 0, NT2RP70031 480//6, NT2RP700871 40//1 , OCBBF20000740//2, 
OCBBF2001 2520//2, OCBBF201 09780//1 , OCBBF201 1 021 0//2, OCBBF201 1 0730//2, OCBBF201 1 2280//3, 

45 OCBBF20118720//3, OCBBF20123200//2, OCBBF20165910//9, OCBBF20176650//1 , OCBBF201 85630//1 , 
OCBBF20191950//3, PANCRI0000860//1 , PLACE60004260//1 , PLACE60006300//2, PLACE60053280//3, 
PLACE6005691 0//1 , PLACE60057860//3, PLACE60061 370//1 , PLACE60070500//2, PLACE601 04630//3, 
PLACE601 0701 0//2, PLACE601 84870//1 , PROST2001 1 1 60//2, PROST200141 50//2, PROST20035B30//1 , 
PROST20050390//1 , PROST200651 00//7, PROST20073280//1 , PROST20082430//1 , PROST20084680//3, 

50 PROST20105450//6, PROST20110120//1 , PROST20114100//2, PROST20152510//4, PROST201 68600//6, 

PUAEN 1 0000870//2, SKMUS20006790//1 , SKMUS20020770//1 , SKNMC20006350//1 , SKNSH20094350//2, 
SMINT20008110//3, SMINT20024140//1 , SMINT20026200//1 , SMINT20028840//1 , SMINT20045470//1 , 
SMINT20081 330//2, SM INT200921 60//1 , SPLEN2001 51 00//2, SPLEN2001 761 0//1 , SPLEN2001 781 0//2, 
SPLEN20024620//2, SPLEN20054500//1 , SPLEN20058180//1 , SPLEN20085910//2, SPLEN20090880//2, 

55 SPLEN201 01 950//5, SPLEN201 04690//1 , SPLEN201 051 00//2, SPLEN201 1 01 80//1 , SPLEN201 21 790//1 , 
SPLEN201 25230//1 , SPLEN201 36700//1 , SPLEN201 75920//1 , SPLEN201 77400//2, SPLEN201 82850//1 , 
SPLEN201 83950//1 , SPLEN201 90080//1 , SPLEN201 90770//1 , SPLEN201 93230//1 , SPLEN201 93790//1 , 
SPLEN20204670//1 , TESOP1 0000350//1 , TESTI200061 60//5, TESTI20038240//1 , TESTI20043220//1 , 
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TESTI20046540//1 , TESTI20046870//1 , TESTI20050400//1 , TESTI20051 730//7, TESTI20053260//2, 
TESTI20053780//1 , TESTI20057200//1 , TESTI20057590//1 , TESTI20061 200//1 0, TESTI200621 20//1 , 
TESTI20063330//1 , TESTI2006341 0//1 , TESTI20068530//2, TESTI20070400//1 0, TESTI20070740//1 , 
TESTI20073460//1, TESTI20095200//6, TESTI20095880//10, TESTI20100090//1, TEST 1201 0239 0//2, 

5 TESTI20113940//3, TESTI20121040//1 , TESTI20149880//3, TESTI20151800//3, TESTI201 62780//4, 
TESTI201 701 70//1 , TESTI201 73050//1 , TESTI201 861 1 0//1 1 , TESTI201 98540//2, TESTI201 991 1 0//1 , 
TESTI20202830//1 , TESTI20204260//1 , TESTI20214630//2, TESTI20244730//1 , TESTI20245860//1 , 
TESTI2024641 0//1 , TESTI20257910//1 , TESTI20260640//1 , TESTI20261 040//4, TESTI202621 50//11 , 
TESTI20262940//1 , TESTI2026491 0//1 , TESTI20271 790//9, TESTI20278280//3, TESTI20282420//2, 

10 TESTI20282900//1 , TESTI20286590//1 , THYMU20007020//3, THYMU2001 2020//1 , THYMU2001 7270//2, 
THYMU20020800//4, THYMU20025480//3, THYMU20030690//7, THYMU20034790//2, THYMU20046350//1 , 
THYMU20050010//5, THYMU20054800//2, THYMU20055740//1 , THYMU200 62770// 1 , THYMU20078240//1 , 
THYMU20079690//2, THYMU20083390//3, THYMU20087270//1 , THYMU201 00940//4, THYMU201 06990//1 , 
THYMU20115380//2, THYMU20137050//3, THYMU20137570//1 , THYMU20143230//2, THYMU20145990//3, 

15 THYMU20150190//2, THYMU20153210//1 0, THYMU20154790//3, THYMU20163600//2, THYMU20171580//1 , 
THYMU20176010//1, THYMU201 78440//1 ( THYMU20185470//2, TRACH20011540//2, TRACH20021380//4, 
TRACH20073990//1, TRACH20081270//2, TRACH20149720//1 , TRACH201 497401/1 , TRACH201 63470//11 , 
TRACH201 66330//3, TRACH201 67090//2, TRACH201 73680//8, UMVEN1 0001 380//2, UTERU20035770//1 , 
UTERU20045200//5, UTERU20064120//7, UTERU20086530//2, UTERU20089300//1 , UTERU200951 00//3, 

20 UTERU20099040//5, UTERU20103200//3, UTERU20125810//1 , UTERU20127150//8, UTERU20139760//1 , 
UTERU20188840//2 

[0223] The Names of clones whose deduced amino acid sequences were detected to have functional domains with 
Pfam, and the name of hit functional domains are as follows. The search resuft is indicated as "clone name//functional 
domain name". When the clone has multiple hit functional domains, they are listed and demarcated by a double slash 
25 mark (//). When the clone has multiple hits of an identical functional domain, each is listed without abridgment. 

ADRGL20020290//Sodium:galactoside symporter family//Nucleoside transporter ADRGL20021910//lmmunoglobulin 
domain 

ADRGL20026790//PWWP domain 

ADRGL20036840//Class I Histocompatibility antigen, domains alpha 1 and 2 
30 ADRGL20047770//ATP synthase (F/14-kDa) subunit 
ADRGL20059610//0-Glycosyl hydrolase family 30 

ADRGL20062330//Spectrin repeatZ/Spectrin repeat//Bacterial flagellin N-terminus//Spectrin repeat//Spectrin repeat// 
Spectrin repeat 

ADRGL20066770//Collagen triple helix repeat (20 copies)//C1q domain 
35 ADRGL20079060//Transglutaminase-like superfamily 

ASTRO20006530//lntermedie filament proteins//Myc leucine zipper domain 
ASTRO20010010//Photosynthetic reaction center protein 
ASTRO2001 0290//PHD-finger 

ASTRO20026320//Viral (Superfamily 1) RNA helicase//Heavy-metai-associated domain//Viral (Superfamily 1) RNA 
helicase 

ASTRO20038400//Homebox domairV/Common central domain of tyrosinase//Rhabdovirus nucleocapsid protein// 
Homeobox domain//Homeobox domain//Homeobox domain 
ASTRO20046280//MutT-like domain 

ASTRO20050810//FGGY family of carbohydrate kinases//FGGY family of carbohydrate kinases 
45 ASTRO20052420//RhoGEF domain//PH domain//SH3 domain 

ASTRO20053430//FERM domain (Band 4.1 family)//FERM domain (Band 4.1 family)//Delta-aminolevulinic acid dehy- 
dratase//Lipoprotein amino terminal region 
ASTRO20055570//Prion protein 

ASTRO20056930//Aldehyde oxidase and xanthine dehydrogenase, Cterminus//Zinc finger, C3HC4 type (RING finger) 
50 ASTRO20085080//WD domain, G-beta repeat//Fibrillarin//WD domain, G-beta repeat//WD domain, G-beta repeat// 
WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
ASTRO20088950//Glycosyl hydrolase family 1 

BGGI120010750//Phosphoglucose isomerase//Ribosomal protein L7Ae 

BNGH420015760//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
55 G-beta repeat//WD domain, G-beta repeat 

BNGH420024870//C2 domain//C2 domain//C2 domain 

BNGH420035290//Zinc finger, C3HC4 type (RING finger)//TRAF-type zinc finger//Hr1 repeat motif//WD domain, G- 
beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta 
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repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
BNGH420036410//Arsenical pump membrane protein 
BNGH420046790//lmmunoglobulin domain 
BNGH4200523507/Urotensin II 

BNGH420059680//NHL repeat//NHL repeat//NHL repeat//NHL repeat 

BNGH420070370//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

BNGH420074600//RNA polymerase beta subunit 

BNGH420077980//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domairV/lmmunoglobulin domain 
BNGH420086030//PH domain//RhoGAP domain 

BRACE20003310//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRACE20007330//BTB/POZ domain//Kelch motif//Kelch motif//Keich motif//Kelch motif//Kelch motif//Kelch motif 
BRACE20014450//alpha/beta hydrolase fold 
B RACE20050870//D E AD/D E AH box helicase 

BRACE20051600//Reverse transcriptase (RNA-dependent DNA polymerase) 

BRACE20051930//MAM domain. 

BRACE200546007/von Willebrand factor type D domain 

BRACE20055560//WD domain, G-beta repeat//WD domain, G-beta repeat 

BRACE2005981 OV/TSC-22/dip/bun family 

BRACE20061620//SPRY domain 

BRACE20065470//Ubiquitin family 

BRACE20069000//CLN3 protein 

BRACE20079200//von Willebrand factor type D domain 

BRACE20099070//FYVE zinc finger 

BRACE201961807/HMG (high mobility group) box 

BRACE20204670//Protein-tyrosine phosphatase//Dual specificity phosphatase, catalytic domain//Fatty acid desatu- 
rase//Protein-tyrosine phosphatase 

BRACE20215410//lmidazoleglycerol-phosphate dehydratase//UvrD/REP helicase 
BRAMY20001510//Zinc finger, C3HC4 type (RING fingerr//PHD-finger 

BRAMY20003540//PH domain//EF hand//EF hand//Viral RNA dependent RNApolymerase//Phosphatidylinositol-spe- 
cific phospholipase C, X domain//Phosphatidylinositol-specific phospholipase C, Y domain//Bleomycin resistance pro- 
tein/A^ domain 

BRAMY20005080//Ubiquitin carboxyl-terminal hydrolase family 2 
BRAMY20013670//S-adenosylmethionine synthetase 
BRAMY20016780//Proprotein convertase P-domain 
BRAMY20023640//UBX domain 
BRAMY20027990//C2 domain 

BRAMY20028620//Quinolinate phosphoribosyl transferase 
BRAMY20035380//Cation efflux family 
BRAMY20035830//BTB/POZ domain/ZThymidylate synthase 

BRAMY20038980//Granulocyte-macrophage colony-stimulating factor//Borrelia outer surface protein E and F 
BRAMY20040580//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

BRAMY20043630//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine 
Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
BRAMY20044920//Ubiquitin carboxyl-terminal hydrolase family 2 
BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 
BRAMY20051820//C2 domain 
BRAMY20056620//Ca transferase domain 
BRAMY20056840//PWWP domain 

BRAMY20076100//Ligand-binding domain of nuclear hormone receptor 
BRAMY20089770//ATP P2X receptor 

BRAMY20091230//Mitochondrial carrier proteins//Mitochondrial carrier proteins 

BRAMY20094890//SURF4 family 

BRAMY20102900//Ephrin 
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BRAMY20111780//Zinc finger, C2H2 type//Transcription factor S-ll (TFI I S)//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C3HC4 type (RING finger)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type 

BRAMY20117670//DnaJ central domain (4 repeats)//DnaJ C terminal region 
BRAMY20118410//Phosphofipase D. Active site motif 
BRAMY20118490//FGGY family of carbohydrate kinases 
BRAMY201 25360//Asparaginase 
BRAMY20134050//Nucleosome assembly protein (NAP) 
BRAMY20143870//Peptidyl-tRNA hydrolase 

BRAMY20152510//Protein-tyrosine phosphatase//Dual specificity phosphatase, catalytic domain 
BRAMY20158550//EF hand//EF hand 

BRAMY20206340//WD domain, G-beta repeat//Dockeiin domain type I//WD domain, G-beta repeat//WD domain, G- 

beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

BRAMY20227860//lnsulin-like growth factor binding proteins//Spectrin repeat 

BRAMY20234820//Ribosomal L25p family 

BRAMY20238630//TPR Domain//TPR Domain//TPR Domain 

BRAMY20244490//Adenylate kinase 

BRAMY20245140//Cyclic nucleotide-binding domain 

BRAMY20251210//Ephrin receptor ligand binding domain//EB module 

BRAMY20263000//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
BRAMY2027451 0//Ribosomal protein L11 

BRAWH20014590//RNA recognition motif, (a. k. a. RRM, RED, or RNP domain) 
BRAWH20021 91 0//Carboxylesterases 

BRAWH20026010//Hepatitis C virus RNA dependent RNA polymerase 

BRAWH20039640//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine rich repeat C-terminal domain// 
Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine rich repeat C-terminal domain 
BRAWH20040680//Zinc finger, C3HC4 type (RING finger)//DENN (AEX-3) domain 
BRAWH20047790//HMG (high mobility group) box 
BRAWH20050740//BTB/POZ domain 

BRAWH20080580//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
DnaJ central domain (4 repeats)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRAWH20093040//Eukaryotic protein kinase domain 
BRAWH20094900//BNR repeat//BNR repeaV/BNR repeat 

BRAWH20095900//KRAB boxZZZinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRAWH20183170//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
BRAWH201 85260ZZTroponinZ/Serine hydroxymethyltransferase 
BRAWH201 85270//Uncharacterized protein family UPF0057 
BRAWH20188750//Glypican//ubiE/COQ5 methy [transferase family 
BRAWH20191980//Proline dehydrogenase 
BRCAN10001050//Cell division protein 

BRCAN20005410//Zinc finger, C4 type (two domains)//Zinc finger, C2HC type//SAM domain (Sterile alpha motif)// 
Sterile alpha motif (SAM)/Pointed domain 
BRCOC20000470//TPR Domain//TPR Domain 
BRSSN20005610//PDZ domain (Also known as DHR or GLGF). 
BRSSN20005660//Bacterial type II secretion system protein 

BRSSN20066440//SCAN domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BRSSN20074640//Peptidase family M48 
BRSSN20093890/ZKelch motifZZKelch motif 

CD34C20001 750ZZImmunoglobulin domainZZImmunoglobulin domain 

CTONG20004110ZZAnk repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Tropomyosins//Transient receptor// 
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Apolipoprotein A1/A4/E family//MutS family, N-terminal putative DNA binding domain//K-box region//Outer membrane 
efflux protein 

CTONG20004520//SH3 domain 

CTONG20007660//Caspase recruitment domain//DNA polymerase (viral) N-terminal domain//bZIP transcription fac- 
tor//K-box region 

CTONG20008190//ADP-ribosylation factor family//Ras family 
CTONG20017490//Sema domain//Sema domain//Integrins, beta chain//Plexin repeat 

CTONG20020950//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG20029030//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine 
Rich Repeat//Leucine Rich RepeatZ/Leucine Rich RepeatZ/Leucine Rich Repeat 

CTONG20030280//WD domain, G-beta repeat//Gram-negative pili assembly chaperone//WD domain, G-beta repeat// 
WD domain, G-beta repeat//WD domain, G-beta repeat/ZWD domain, G-beta repeat//WD domain, G-beta repeat 
CTONG20032930//Armadillo/beta-catentn-like repeats 
CTONG20033610//RyR domain 

CTONG20033750//FtsK/SpolllE family//ATPases associated with various cellular activities (AAA) 
CTONG20036990//lmmunoglobulin domain//lmmunoglobulin domain 
CTONG20044230//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG20044870//PH domain//PH domain 

CTONG20045500//Translation initiation factor IF-3//HC03- transporter family//HC03- transporter family//Domain of 
unknown function DUF139 
CTONG20046690//Src homology domain 2 

CTONG20049480//bZIP transcription facto r//Carbamoyl -phosphate synthase (CPSase)//tRNA synthetases class I (C) 
CTONG20060040//NusB family 
CTONG20063770//KE2 family protein/ZSpectrin repeat 

CTONG20063930//SH3 domain//WW domain//WW domain//PH domain//RhoGAP domain 
CTONG20066110//TPR Domain//TPR Domain 

CTONG20068360//Mitochondrial carrier proteins//Mitochondrial carrier proteins 
CTONG20069420//Ribosomal protein S14p/S29e 
CTONG20070090//Bacterial luciferase 

CTONG20070720//PH domain//RhoGAP domain//bZIP transcription factor 
CTONG20070910//PCI domain 

CTONG20071040//Beta/Gamma crystallin//Beta/Gamma crystailin//Beta/Gamma crystallin//Beta/Gamma crystallin// 
Similarity to lectin domain of ricin beta-chain, 3 copies. 

CTONG20071 680//3'-5' exonuclease//Cytochrome c oxidase subunit lll//Ammonium Transporter Family//7 transmem- 
brane receptor (Secretin family) 

CTONG20072930//KRAB box//Ribosomal protein L20//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG20074000//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

CTONG20074170//DENN (AEX-3) domain 
CTONG20076810//Ribosomal protein L1 6//Pheromone A receptor 
CTONG20079590//Sialyltransferase family 
CTONG20083430//Nuclear transition protein 2 
CTONG20083980//WH1 domain 
CTONG20084660//SCAN domain 
CTONG20085210//Lipase (class 3) 

CTONG20165750//G-patch domain//Double-stranded RNA binding motif 
CTONG20169040//bZIP transcription factor//Adenylate cyclase//! ntermediate filament proteins 
GTONG20170940//Ank repeat//Ank repeat//Ank repeat//SAM domain (Sterile alpha motif) 
CTONG20174580/ZTBC domain 

CTONG201 76040//ADP-ribosyiation factor family//Ras family 
CTONG20180690//Collagen triple helix repeat (20 copies) 

CTONG20183430//Ank repeat//Ank repeatZ/Ank repeatZ/Ank repeat//Ank repeat//Ank repeaV/Ank repeat//Ank repeat// 
Zinc finger, C3HC4 type (RING finger)//Zinc finger, C3HC4 type (RING finger)//Zinc finger, C3HC4 type (RING finger) 
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CTONG20184830//ABC transporter 
CTONG20186290//Aldehyde dehydrogenase family 

CTONG20186370//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type 

5 CTONG20186520//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
CTONG201 86550//BTB/POZ domain//K+ channel tetramerisation domain 

10 CTONG201 88080//TPR Domain 

CTONG20189000//RhoGEF domain 

CTONG20190290//R3H domain//RNA dependent RNA polymerase//Uncharacterized protein family UPF0024 

DFNES200255007/Sigma-54 interaction domain//ATPases associated with various cellular activities (AAA) 

DFNES20043710//Src homology domain 2//Domain of unknown function DUF36 
15 DFNES20055400/A/iral coat proteirvVPutative diphthamide synthesis proteirV/lnfluenza non-structural protein (NS1 ) 

DFNES20057660//Plant thionins//Mitochondrial carrier proteins//Mitochondrial carrier proteins 

DFNES20072990//lntegral membrane protein DUF6// Integral membrane protein DUF6 

DFNES20073320//Zinc finger, C3HC4 type (RING finger)//PHD-finger//B-box zinc finger. 

DFNES20076340//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
20 DFNES200808807/Glycosyl transferases//Similarity to lectin domain of ricin beta-chain, 3 copies. 

DFNES20094820//PHD-finger//Zn-finger in ubiquitin-hydrolases and other proteins 

FCBBF10000230//Sec7 domain//PH domain 

FCBBF10004760//HMG (high mobility group) box 

FCBBF20018680//C2 domain//C2 domain 
25 FCBBF20020440//Urease 

FCBBF20023490//Helicases conserved C-terminal domain 

FCBBF20033360//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 
FCBBF20035430//AN1 -like Zinc finger//AN1 -like Zinc finger 
FCBBF20035490//KH domain 
30 FCBBF20041 380//SAM domain (Sterile alpha motif) 
FCBBF20043730//UBA domain 
FCBBF20059660//TPR Domain 
FCBBF20066340//PH domain 

FCBBF20070950//DNA binding domain with preference for A/T rich regions 
35 FCBBF30001 1 00//DENN (AEX-3) domain//PPR repeat 
FCBBF30002270//linker histone H1 and H5 family 

FCBBF30003610//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Plant PEC family metal- 
loth ionein//DM DN A binding domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, 
C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Phorbol esters/diacylglycerol binding domain 
40 (C1 domain) 

FCBBF30004340//Ribosomal protein S3, C-terminal domain .//Similarity to lectin domain of ricin beta-chain, 3 copies. 
FCBBF30005360//Sigma-54 interaction domairvVATPases associated with various cellular activities (AAA)//ATPases 
associated with various cellular activities (AAA) 

FCBBF30005500//PH domairvVPH domairvVPutative GTP-ase activating protein for Arf 
45 FCBBF30019140// , chromo' (CHRromatin Organization Modifier) domain// , chromo' (CHRromatin Organization Modifi- 
er) domain//D EAD/D E AH box helicase//SNF2 and others N-terminal domain//Helicases conserved C-terminal domain 
FCBBF30019180//Armadillo/beta-catenin-like repeats//Lipo protein amino terminal region 

FCBBF30021900//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//BolA-like protein//Zinc finger, C2H2 type//Zinc finger, C2H2 
50 type//Snake toxin//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type FCBBF30022680//3 , -5 , ex- 
onuclease 

FCBBF30029250//SET domain 
FCBBF30035570//C2 domain 
55 FCBBF30048420//T-box//wnt family of developmental signaling proteins 

FCBBF30071500//lnfluenza RNA-dependent RNA polymerase subunit PB1//Reprolysin family propeptide//Leptin 
FCBBF30076310//Eukaryotic protein kinase domain//Eukaryotic protein kinase domain//Protein kinase C terminal do- 
main 
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FCBBF30079770//D-isomer specific 2-hydroxyacid dehydrogenases 
FCBBF30080730//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//Zinc knuckle 
FCBBF30085560//Pyridine nucleotide-disulphide oxidoreductase//FAD binding domain//Flavin containing amine oxi- 
dase//Phytoene dehydrogenase related enzyme 
5 FCBBF300931 70//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 
FCBBF30100080//Sec7 domain 

FCBBF30100120//PD2 domain (Also known as DHR or GLGF). 
FCBBF30100410//Nucleosome assembly protein (NAP) 
10 FCBBF30118670//Reprolysin (M12B) family zinc metalloprotease//Disintegrin//EB module//Hantavirus glycoprotein 
G2//Adenovirus E3 region protein CR2//jmjN domain 
FCBBF30125460//Zinc finger, C3HC4 type (RING finger) 

FCBBF30129010//KRAB box//Zinc finger, C2H2 type//GATA zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type 
15 FCBBF301 3041 0//UvrB/uvrC motif 

FCBBF30132050//Galactosyltransferase//Fringe-iike 

FCBBF30132660//Leucine rich repeat N-terminal domain//Leucine Rich RepeaV/Leucine Rich RepeatZ/Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeaWLeucine Rich Repeat 
FCBBF30136230//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type/VZinc finger, C2H2 type// 
20 Cystatin domain//Homeobox domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
FCBBF301 42290//PHD-f inger 

FCBBF301 43550//Phosphatidylinositol-4-phosphate 5-Kinase 

FCBBF30153170//Phosphofructokinase//Phosphofructokinase//Phosphofructokinase//Phosphofructokinase 

FCBBF30161780//gag gene protein p24 (core nucleocapsid protein)//Zinc knuckle 
25 FCBBF30164510//Cadherin domain//Cadherin domain//Cadherin domain//Fructose-bisphosphate aldolase class-l// 

Cadherin domain//Cadherin domain//Alphaherpesvirus glycoprotein E//Cadherin cytoplasmic region 

FCBBF301 66220//Serine hydroxymethyltransf erase 

FCBBF301 69280//PHD-finger//Zinc finger, C3HC4 type (RING finger) 

FCBBF30171230//Subtilasefamily//Proprotein convertase P-domain 
30 FCBBF30173960//Beta-lactamase//TPR Domain 

FCBBF30194550//Arsenical pump membrane protein//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 

Dehydrogenase E1 component//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat 

FCBBF30195690//C2 domain 

FCBBF301 97840//Sushi domain (SCR repeat)//CUB domaiiV/Sushi domain (SCR repeat)//CUB domain//Sushi domain 
35 (SCR repeat)//Sushi domain (SCR repeat)//Sushi domain (SCR repeat) 

FCBBF3021 221 0//Immunoglobulin domain 

FCBBF30215240//PH domain//FERM domain (Band 4.1 family) 

FCBBF30220050//Ligand-binding domain of nuclear hormone receptor 

FCBBF3022291 0//Corticotropin-releasing factor family 
40 FCBBF30236670//DEAD/DEAH box helicase 

FCBBF30250980//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 

G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

FCBBF30255680//Vesicular monoamine transporter//GGL domain//Ezrin/radixin/moesin family//Ank repeat//Ank re- 
peat 

45 FCBBF30257370//Geminivirus AL1 protein//Outer membrane efflux protein 

FCBBF30259050//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger present in dys- 
trophin, CBP/p300//Zinc finger, C2H2 type//PHD-f inger/ZZinc finger, C2H2 type//Zinc finger, C2H2 type//DM DNA bind- 
ing domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 
tion factor S-ll (TFI IS)//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 

50 Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FCBBF30260210//SNF2 and others N-termina domain//Helicases conserved C-terminal domain//Domain of unknown 
function DUF94 

FCBBF30263080//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
FCBBF30266510//Domain of unknown function DUF71 
55 FCBBF30271 990//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Ank repeat//Ank repeat//Ank repeat//Ank repeat 
FCBBF30275590//LIM domain containing proteins 

FCBBF30282020//Ank repeat//Ank repeat//Ank repeat//K+ channel tetramerisation domain//BTB/POZ domain 
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FCBBF30285930//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

FCBBF50001650//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif// 
MORN motif 

FCBBF50003530//Zinc finger, C3HC4 type (RING finger) 
FCBBF50004950//Putative replicase 1 (ORF2) 

FEBRA20005040//lntermediate filament proteins//Hr1 repeat motif//Troponin//GrpE 
FEBRA20007820//DNA polymerase family B 

FEBRA20018670//Viral methyltransferase//Riboso protein 51 9//Alpha-2-macroglobulin family N-terminal region 
FEBRA20026820//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FEBRA20027070//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Tran script ion factor S-ll (TFIIS)//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

FEBRA20029620//Leucine Rich Repeat//Leucine Rich RepeaV/Leucine Rich Repeat 

FEBRA20031000//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//lnsulin/IGF/Relaxin family//Ribosomal RNA adenine dimethylases//SAM domain 
(Sterile alpha motif)//TFIIE alpha subunit/ZZinc finger, C3HC4 type 
(RING finger) 

FEBRA20031280//Protein of unknown function DUF82 
FEBRA20038330//Corticotropin-releasing factor family 

FEBRA20038970//Laminin EGF-like (Domains III and V)//Pho esters/diacy I glycerol binding domain (C1 domain)//EGF- 
like domain//EGF-like domain//Trypsin Inhibitor like cysteine rich domain//Metallothionein//EGF-like domain//EGF-like 
domain//Extracellular link domain 

FEBRA20046200//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat 
FEBRA2004651 0//Zinc finger, C2H2 type//Zinc finger, C2H2 type//BolA-like protein//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type 

FEBRA20063720//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

FEBRA20078800//NADH ubiquinone oxidoreductase, 20 Kd subunit 
FEBRA20080860//Hantavirus glycoprotein G2 

FEBRA20087550//WD domain, G-beta repeat//WD domain, G-beta repeat 
FEBRA20088610//CRAUTRIO domain. 
FEBRA20088810//Fibroblast growth factor 
FEBRA20090160//Nuclear transition protein 2 

FEBRA20090220//Nucleotidyl transferase//Bacterial transferase hexapeptide (four repeats)//Bacterial transferase 
hexapeptide (four repeats)//Bacterial transferase hexapeptide (four repeats)//Domain of unknown function DUF29// 
Peptide hormone//elF4-gamma/elFS/elF2-epsilon 

FEBRA20092760//LIM domain containing proteins//LIM domain containing proteins//LIM domain containing proteins 
FEBRA20115930//Divalent cation transporter/ZTranslation initiation factor IF-3//Divalent cation transporter 
FEBRA20150420//Transient receptor//Borrelia ORF-A//Transient receptor 

FEBRA201 70240//SCAN domain//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 

FEBRA201 72230//Phospholipase D. Active site motif//Phospholipase D. Active site motif 
FEBRA201 73330//Eukaryotic protein kinase domain 
FEBRA201 75330//D-isomer specific 2-hydroxyacid dehydrogenases 
FEBRA20191720//GGL domain//Regulator of G protein signaling domain 

HCHON1 00001 50//LIM domain containing proteins//LIM domain containing proteins//LIM domain containing proteins// 
Zinc finger, C3HC4 type (RING finger) 
HCHON20000870//Eukaryotic protein kinase domain 

HCHON20002650//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
HCHON20002710//Ubiquitin carboxyl-terminal hydrolase family 2 
HCHON20015050//von Willebrand factor type A domain//! PT/TIG domain//Sm protein 
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HEART10001420//MYND finger//SET domain 
HEART10001490//FAD binding domain 
HEART20009590//Peptidase family M41 

HEART2001931 0//Zinc finger, C3HC4 type (RING finger)//PHD-finger//B-box zinc finger. 

5 HEART20022200//lnfluenza Matrix protein (M 1 )//metallopeptidase family M24 

HEART20047640//3'5 , -cyclic nucleotide phosphodiesterase//Formin Homology 2 Domain 
HEART20082570//C-5 cytosine-specific DNA methylase//Glycine cleavage T-protein (aminomethyl transferase) 
HHDPC20081230//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
RBD, or RNP domain)//RNA recognition motif, (a. k. a. RRM, RBD, or RNP domain)//RNA recognition motif, (a. k. a. 

10 RRM, RBD, or RNP domain) 

HHDPC20088160//PDZ domain (Also known as DHR or GLGF). 
HLUNG20008460//bZIP transcription factor 
HLUNG20011260//Eukaryotic protein kinase domain 

HLUNGZOOI1460//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
15 RBD, or RNP domain) 

HLUNG20014590//Zinc finger, C2H2type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
HLUNG20015070//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
20 RepeaWLeucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine rich repeat 
C-terminal domain//lmmunoglobulin domain 
HLUNG20024050//Rubredoxin 

HLUNG20028110//Zinc finger, C3HC4 type (RING finger)//TPR Domain//TPR Domain//TPR Domain//Zinc finger, 
C3HC4 type (RING finger)//Aldo/keto reductase family 
25 HLUNG20030420//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat 

HLUNG20032460//Pro oligopeptidase family//Lipoate-protein ligase B//alpha/beta hydrolase fold 
HLUNG20033060//HMG (high mobility group) box 

HLUNG20041590//TPR Domain//TPR Domain//Domain of unknown function DUF27 
HLUNG20042730//Cytochrome P450 
30 HLUNG20051330//FHIPEP family 

HLUNG20063700//Progesterone receptor 

HLUNG20065990//bZIP transcription factor//SNAP-25 family//Syntaxin 
HLUNG20068120//Fimbrial Usher proteins 
HLUNG20069350//EF hand//EF hand//EF hand 
35 HLUNG2007041 0//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
HLUNG20081390//DnaJ domain 
HLUNG20082350//PH domain//START domain 

HLUNG20083330//DNA polymerase III beta subunit//Fibrillar collagen C-terminal domain 
HLUNG20084790//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

40 HLUNG20085210//C2 domain 

KIDNE20016360//PAS domain//PAC motif//lon transport protein/zTransmembrane region cyclic Nucleotide Gated 
Channel//Cyclic nucleotide-binding domain KIDNE20027980//SAM domain (Sterile alpha motif) 
KIDNE20080690//Glycosyl hydrolases family 1 8//Aminotransferases class-Ill pyridoxal-phosphate//Methyl-accepting 
chemotaxis protein (MCP) signaling domain//Aminotransferases class-Ill pyridoxal-phosphate 

45 KIDNE20081170//Kinesin motor domain//Kinesin motor domain 
KIDNE20083620//Asparaginase 
KIDNE20084030//Sugar (and other) transporter 

KIDNE20084730//Neuraxin and MAP1B proteins//Formin Homology 2 Domain 
KIDNE20086490//gag gene protein p24 (core nucleocapsid protein) 
50 KIDNE20087880//Reverse transcriptase (RNA-dependent DNA polymerase) 
KIDNE20088240//EF hand//EF hand//EF hand 

KIDNE20089870//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat/VWD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

KIDNE20094670//Sigma-54 interaction domain//FtsK/SpolllE family//ATPases associated with various cellular activi- 
55 ties (AAA) 

KIDNE20133880//G-protein alpha subunit 
KIDNE20141700//Ribosomal family S4e 

KIDNE20142900//EGF-like domain//EB module//EGF-like domain//EGF-like domain 
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KIDNE20148080//Zinc finger, C3HC4 type (RING finger) 
KIDNE20149780//Ank repeat//Ank repeat//Ank repeat 
KIDNE20152440//Trypsin//PDZ domain (Also known as DHR or GLGF). 

KIDNE20154330//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF)7/PDZ 
5 domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as 
DHR or GLGF)7/PDZ domain (Also known as DHR or GLGF). 
KIDNE20160360//Sec7 domain//PH domain 

KIDNE20169180//EGF-like domain//EB module//EGF-like domain/ZTrypsin Inhibitor like cysteine rich domain//EGF- 
like domain//TNFR/NGFR cysteine-rich region//Zona pellucida-like domain 
10 KIDNE20170400//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Protein kinase C terminal do- 
main//Rubredoxin 

KIDNE201 731 50//Acetyltransf erase (GNAT) family 

KIDNE20173430//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF). 
KIDNE20186170//UDP-glucoronosyl and UDP-glucosyl transferases 
15 KIDNE20189960//Trehalase 

LIVER20006260//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Plant PEC family metallothionein//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

LIVER20007750//Sugar (and other) transporter 
20 LIVER2001 0760//Lectin C-type domain 

LIVER20010990//DNA gyrase/topoisomerase IV, subunit A 

LIVER20013890/ARNA synthetases class I (C) 

LIVER20026440//Cytochrome P450 

LIVER20030650//General diffusion Gram-negative porins 
25 LIVER20038000//Mitochondrial carrier proteins//Mitochondrial carrier proteins//Mitochondrial carrier proteins 

UVER20040740//CRAL/TRIO domain. 

LIVER20055270//AIR synthase related protein 

MESAN20006200//Annexin//Annexin 

MESAN20008150//IBR domain 
30 MESAN20009090//CUB domain 

MESAN20016270//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

MESAN20021130//SH3 domain//Eukaryotic protein kinase domain 
35 MESAN20021220//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 

Rich RepeatZ/Leucine Rich Repeat//Leucine rich repeat C-terminal domain//Leucine rich repeat N-terminal domain// 

Leucine Rich Repeat//Leucine Rich RepeaV/Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leu- 

cine rich repeat C-terminal domain 

MESAN20026870//PAN domain//TBC domain 
40 MESAN20027240//RhoGEF domain 

MESAN20027900//von Willebrand factor type A domain//von Willebrand factor type A domain//von Willebrand factor 

type A domain//Protozoan/cyanobacterial globin//von Willebrand factor type A domain//von Willebrand factor type A 

domain 

MESAN20030350//LGN motif, putative GEF specific for G-alpha GTPase//DNA gyrase/topoisomerase IV, subunit A 
45 MESAN20030370//Porphobilinogen deaminase//GHMP kinases putative ATP-binding proteins//Protein of unknown 
function DUF113 

MESAN20034440//Viral DNA-binding protein/AJncharacterized protein family UPF0024 

MESAN20038520//Caspase recruitment domain//RNA polymerase beta subunit 

MESAN20057240//Elongation factor TS//Helix-hairpin-helix motif. 
50 MESAN2005B110//FKBP-type peptidyl-prolyl cis-trans isomerases//Eiongation factor Tu family//FKBP-type peptidyl- 

prolyl cis-trans isomerases//FKBP-type peptidyl-prolyl cis-trans isomerases 

MESAN20059570//EGF-like domain//SEA domain//lmmunoglobulin domain 

MESAN20060430//SET domain 

MESAN20067430//T ropomyosins 
55 MESAN20069530//Calponin homology (CH) domain//PDZ domain (Also known as DHR or GLGF). 

MESAN20090190//EGF-like domain//EGF-like domain//EGF-like domain//EGF-like domain//EGF-like domain//EGF- 

like domain//Metallothionein//CUB domain 

MESAN20095220//Adaptin N terminal region 
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MESAN20095800//Cyclophilin type peptidyl-prol cis-trans isomerase//RNA recognition motif, (a.k.a. RRM, RBD, or 
RNP domain) 

NT2NE20018890//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat 

NT2NE20026200//Transketolase 
NT2NE20026510//Zinc finger, C3HC4 type (RING finger) 

NT2NE20038870//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Lipoprotein 

NT2NE20042550/A/iral (Superfamily 1) RNA helicase//NB-ARC domain//Adenylate kinase//Adeny!ate kinase 
NT2NE20053950//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

NT2NE20060750//KRAB box//Zinc finger, C2H2 type 
NT2NE20061030//SCAN domain 

NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20079670//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
NT2NE20087270//Tudor domain 
NT2NE20092950//Fibronectin type Hi domain 

NT2NE20108420//Domain of unknown function DUF1307/Oxysterol-binding protein 
NT2NE20117580//NADH ubiquinone oxido reductase, 20 Kd subunit 
NT2NE20127900//Myo-inositol-1 -phosphate synthase 
NT2NE20140130//Sema domain 
NT2NE20145250//Stathmin family 

NT2NE20153620//FERM domain (Band 4.1 family)//PH domain 
NT2RI2009301 07/Tetrahydrofolate dehydrogenase/cyclohydrolase 
NT2RP70001120//Ank repeat//Ank repeat//Ank repeat//! nsulinase (Peptidase family M16) 
NT2RP70001730//BTB/POZ domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 
NT2RP70003110//Collagen triple helix repeat (20 copies)//Heavy-metal-associated domain 

NT2RP70027790//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat 

NT2RP70029780//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2type//PHD-finger//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll 
(TFIIS) 

NT2RP70030840//Viral (Superfamily 1) RNA helicase 
NT2RP70031 070//Nucleoside transporter 
NT2RP70031 340//PHD-finger 

NT2RP70046410//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NT2RP70056690//Thrombospondin type 1 domain//Domain of unknown function DUF18//Thrombospondin type 1 do- 
main//Thrombospondin type 1 domain//Thrombospondin type 1 domain//Thrombospondin type 1 domain//Keratin, high 
sulfur B2 protein 

NT2RP70057500//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DnaJ central do- 
main (4 repeats)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

NT2RP70064570//Calpain family cysteine protease//Calpain large subunit, domain III//EF hand//EF hand 
NT2RP70075300//KRAB box//Domain of unknown function DUF19//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NT2RP70075800//recA bacterial DNA recombination proteins//WD domain, G-beta repeat//WD domain, G-beta re- 
peat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat// 
SH3 domain 

NT2RP70090870//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DM DNA binding domain//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 
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tion factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
NTONG2000531 0//Ribosomal protein S9/S1 6 
NTONG20029850//EF hand//EF hand//EF hand 
NTONG20031580//Hsp20/alpha crystallin family 
NTONG20032100//lntermediate filament proteins 

NTONG20034540//GAF domain//GAF domain//3'5'-cyclic nucleotide phosphodiesterase 
NTONG20035150//BTB/PO2 domain//Kelch motif//Kelch motif//Kelch motif//Kelch motif//Kelch motif 
NTONG20043080//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain 
NTONG20048440//PH domain//K-box region 

NTONG20053630//DNA binding domain with preference for A/T rich regions//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Progesterone receptor//Zinc finger, C2H2 type 
NTONG20053730//Ribosomal protein 52//Ubiquitin carboxyl-terminal hydrolases family 2//Ubiquitin carboxy I -terminal 
hydrolase family 2 

NTONG200539107/Translin family//PH domain//Putative GTP-ase activating protein for Arf//Ank repeat//Ank repeat 
NTONG20055200//Calcium channel extracellular region//Elongation factor Tu family//E!ongation factor G C-terminus 
NTONG2005801 0//AMP-binding enzyme 

OCBBF20000740//Leucine rich repeat N-terminal domain//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
RepeatZ/Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine rich repeat C-terminal domain//lmmunoglobulin domain 
OCBBF20011860//LIM domain containing proteins 

OCBBF20012520//Leucine rich repeat N-terminal domainZZLeucine Rich Repeat//Leucine Rich Repeat//Leucine Rich 
Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine rich repeat C-terminal domainZZImmunoglobulin domain 
OCBBF20016390//Chitin synthase//Fibronectin type II domain 
OCBBF20016810//'Paired box' domain 

OCBBF20110210//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain 
OCBBF20113110//AP endonuclease family 1 

OCBBF201 1 6250//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type 
OCBBF20120010//Thrombospondin type 1 domain 

OCBBF20120950//K+ channel tetramerisation domain//BTB/POZ domain/ZZinc finger, C2H2 type//Zinc finger, C2H2 
type 

OCBBF20121910//Ribosomal protein L24e 
OCBBF20147070//DNA polymerase (viral) C-terminal domain 

OCBBF201 56450//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

OCBBF20157970//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 typeZZZinc 
finger, C2H2 type//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//DM 
DNA binding domain//Zinc finger, C2H2 type 
OCBBF201 6591 0//DnaB-like helicase 
OCBBF20166890//CAP-Gly domain//CAP-Gly domain 

OCBBF20166900//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 typeZ/Transcription factor S-ll 
(TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

OCBBF20167290//haloacid dehalogenase-like hydrolase//Thioredoxin//Aminoglycoside phosphotransferase// 
Acyl-CoA dehydrogenase 
OCBBF201 74580//Cyclin 
OCBBF20174890//Death domain 
OCBBF20175360//Zinc finger, C2H2 type 

OCBBF20177540//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 typeZZZinc finger, C2H2 type//Src homology 
domain 2//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
OCBBF2017791 0//Corticotropin-releasing factor family 
OCBBF20182060//lnward rectifier potassium channel 

OCBBF20191950//EGF-like domain//Low-density lipoprotein receptor domain class A//Low-density lipoprotein recep- 
tor domain class A//EGF-like domain//Low-density lipoprotein receptor domain class A//EGF-like domain//Low-density 
lipoprotein receptor domain class A//Low-density lipoprotein receptor domain class MEGF-like domain//EGF domain// 
Low-density lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-density lipopro- 
tein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat 
class B//EGF-like domain//60Kd inner membrane protein 
PANCR1 0000860//Trypsin//Trypsin 

PEBLM1 0001470//Zinc finger, C2H2 type//Fork head domain 
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PEBLM20001800//lmmunoglobulin domain//lmmunoglobuIin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domain 

PEBLM20003260//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Src homology do- 
main 2//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Src homology domain 2//Zinc finger, C2H2 type 
PEBLM20005020//Virion host shutoff protein 
PLACE50001390//PHD-finger 
PLACE60004260//Cystatin domain 
PLACE60006300//lmmunoglobulin domain 

PLACE60012620//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//AN1-like Zinc finger//FYVE zinc finger 
PLACE60054230//C methyltransferase//Formin Homology 2 Domain 

PLACE60054820//Regulator of chromosome condensation (RCC1 )//Regulator of chromosome condensation (RCC1 ) 
//Regulator of chromosome condensation (RCC1)//Regulator of chromosome condensation (RCC1) 
PLACE60054870//IQ calmodulin-binding motif//IQ catmodulin-binding motif 
PLACE60055590//Zinc finger, C3HC4 type (RING finger) 

PLACE60061370//Phosphotyrosine interaction domain (PTB/PID)7/Extracellular link domain 
PLACE60062660//Gamma-adaptin, C-terminus 

PLACE600641 80//Lumenai portion of Cytochrome b559, alpha (gene psbE) subunit.//Viral (Superfamily 1) RNA heli- 
case 

PLACE60066970//SCAN domain//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type 

PLACE50070500//lmmunoglobulin domain//lmmunoglobulin domain 
PLACE60073090//Myo-inositol-1 -phosphate synthase 
PLACE60074820//Adenylate kinase 

PLACE60082850//Pathogenesis-related protein Bet v I family 
PLACE60087680//Thyroglobulin type-1 repeat 

PLACE60093380//Penicillin amidase//Bacterial regulatory proteins, lad family/A/acuolar sorting protein 9 (VPS9) do- 
main 

PLACE60095600//DEAD/DEAH box helicase 
PLACE60098350//MAGE family 

PLACE60104fi30//Photosystem I reaction centre subunit VIII 

PLACE60113340//EGF-like domain//Laminin G domain//lnsulin-like growth factor binding proteins//EGF-like domain// 
Laminin G domain 

PLACE60118810//bZIP transcription factor//TPR Domain//TPR Domain//TPR Domain//TPR Domain//PPR repeat 
PLACE641 1 9700//EF hand 
PLACE60122970//Zinc finger, C2H2 type 

PLACE60138840//Syndecan domain//Mitochondrial carrier proteins//Mitochondrial carrier proteins//Mitochondrial car- 
rier proteins 

PLACE60140640//Phosphoribulokinase//Shikimate kinase//Uncharacterized protein family UPF0038 
PLACE60154450//RhoGAP domain 

PLACE60177880//lmmunoglobu!in domain//lmmunoglobulin domain 
PLACE60181870//Pentaxin family 
PLACE60184870//LBP / BPI / CETP family 
PROST10001100//Keratin, high sulfur B2 protein 

PROST20007170//KRAB box//Zinc finger, C2H2 type//Cyclopropane-fatty-acyl-phospholipid synthase/ZZinc finger, 
C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcrip- 
tion factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
PROST20015210//Myosin tail//Borrelia lipoproteins//Myosin tail 
PROST20016760// , chromo l (CHRromatin Organization Modifier) domain 

PROST20024250//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
PROST20028970//Ank repeat//Ank repeatZ/Glutamine am idotransf erases class-ll//Ank repeat//Ank repeat 
PROST20033240//Ephrin receptor ligand binding domain//EB module//TN FR/NGFR cysteine-rich region//Fibronectin 
type III domain 

PROST20036350//Glutathione S-transferases.//Ribosomal protein S24e//lnterferon alpha/beta domain/ARNA syn- 
thetases class I (E and Q) 
PROST20045700//Keratin, high sulfur B2 protein 
PROST20050390//Cytochrome P450 

PROST20051310//DEAD/DEAH box helicase/ZToprim domain//Helicases conserved C-terminal domain//Zinc knuckle 
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PROST20065790//Phosphofructokinase//Phosphofmctokinase//Phosphofmctokinase//Phosphofmctokinase 
P ROST20073280//Transposase 

PROST20075280//lmmunoglobulin domain//lmmunoglobulin domain//Thrombospondin type 1 domain 
PROST20082430//Cyclophilin type peptidyl-prolyl cis-trans isomerase 
PROST20084720//Cytochrome P450 

PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20099090//Disintegrin 

PROST20102190//EF hand//Ribosomal RNA adenine dimethylases//EF hand 
PROST20l05450//Sodium/hydrogen exchanger family 
P ROST201 27450//TSC-22/dip/bun family 
PROST20130320//S-100/ICaBP type calcium binding domain 
PROST20152510//TPR Domain//TPR Domain//TPR Domain 

PROST20155370//SCAN domain//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 
PROST20168600//KTN NAD-binding domain 
FUAEN10000650/fi"SC-22/dip/bun family 

PUAEN10001640//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
PUAEN20000800//Bleomycin resistance protein 
PUAEN20001520//Acetyltransferase (GNAT) family 
PUAEN20002470//FtsK/SpolllE family 

PUAEN20003120//Myb-like DNA-binding domain//ATP synthase ab C terminal//SET domain 
SALGL10001070//Family 4 glycosyl hydrolase 
SKMUS20006790//von Willebrand factor type D domain 
SKMUS20007260//Syndecan domain 
SKMUS20008730//Calponin homology (CH) domain 
SKMUS20017400//Hantavirus nucleocapsid protein//Tropomyosins 
SKMUS20040440//Ribosomal protein L3 
SKMUS20091 900//Trypsin 

SKNMC100012307/Ank repeat//Ank repeat//Ank repeat 

SKNMC20006350//FKBP-type peptidyl-prolyl cis-trans isomerases//FKBP-type peptidyl-prolyl cis-trans isomerases// 
FKBP-type peptidyl-prolyl cis-trans isomerases//EF hand//EF hand 
SKNSH2000971 07/T ropomyosins//T ropomyosins 

SKNSH20052400//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
SKNSH20057920//Eukaryotic protein kinase domain 
SMINT20000070//Rabphilin-3A effector domain 

SMINT20002320//Protein phosphatase 2A regulatory B subunit (B56 family) 

SMINT20006020//PH domain//Phor esters/diacylglycerol binding domain (C1 domain)//FYVE zinc finger//PH domain 

SMINT20006090//GlutathioneS-transferases. 

SMINT20008110//Na+/K+ ATPase C-terminus 

SMINT20011950//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type 
SMINT20012220//Collagen triple helix repeat (20 copies) 
SMINT20016150//gag gene protein p17 (matrix protein).//Ferrltins 
SMINT20024140//lmmunoglobulin domain 

SMINT20028800//Zinc finger, C2HC type//SAM domain (Sterile alpha motif)//Sterile alpha motif (SAM)/Pointed domain 
SMINT20028840//lmmunoglobulin domain 

SMINT20030740//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type 

SMINT20035050//SH3 domain//WW domain//PH domain//RhoGAP domain 

SMINT20036440//ENTH domain 

S M I NT20038660//O If actomed in- li ke domai n 

SMI NT20039050//Flavi virus glycoprotein//! BR domain 

SMINT20043390//Ras association (RalGDS/AF-6) domain 

SMINT20044140//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
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SMINT20044730//Envelope glycoprotein GP120 
SMINT20048720//Cytochrome P450//Cytochrome P450 
SMINT20052130//Ank repeat//Ank repeat 
SMINT20056230//lmmunoglobulin domain//lmmunoglobulin 
main 



domain//lmmunoglobulin domain//lmmunoglobulin do- 



SMI NT20062050//Reverse transcriptase (RNA-dependent DNA polymerase) 
SMINT20077960//Gelsolin repeatVZGelsoIin repeatV/Gelsolin repeatV/Gelsolin repeat. 

SMINT20083290//lmmunoglobulin domain//lmmunoglobulin domain/Zlmmunoglobulin domain//lmmunoglobulin do- 

main//lmmunoglobulln domain 

SMINT20086250//Glycine cleavage H-protein 

SMINT20086720//SCAN domain//KRAB box 

SMINT20088440//lmmunoglobulin domain 

SMINT20089600//PDZ domain (Also known as DHR or GLGF). 

SMINT20091190//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domain 

SPLEN20006950//Reverse transcriptase (RNA-dependent DNA polymerase) 
SPLEN2001 1 350//Helper component proteinase 
SPLEN2001 51 00//DHHC zinc finger domain 
SPLEN20023540//RasGEF domain//EF hand//EF hand 

SPLEN20023850//RecF protein//SMC domain N terminal domain/ZTropomyosins 

SPLEN20024190//EGF-like domain//EB module//EGF-like domain/ZTrypsin Inhibitor like cysteine rich domainZZEGF- 
like domain//EGF-like domain//WAP-type (Whey Acidic Protein) 'four-disulfide coreV/EGF-like domain 
SPLEN20024930//lnosi polyphosphate phosphatase family, catalytic domain 
SPLEN200391 80//Fatty acid desaturase 

SPLEN20042200//Zinc finger, C3HC4 type (RLNG f inger)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type//Zinc finger, C2H2 type 

SPLEN20049840//Polyphosphate kinase//Myosin head (motor domain)//IQ calmodulin-binding motif//SH3 domain 
SPLEN20050090//Pyridoxamine 5'-phosphate oxidase//GRIP domain 
SPLEN20054500//Renal dipeptidase 

SPLEN20055600//K+ channel tetramerisation domain//BTB/POZ domain//Zinc finger, C2H2 type 
SPLEN20057900//Scorpion short toxins//EGF-l ike domain//EGF-like domain//Keratin, high sulfur B2 protein 
SPLEN20059270//Zinc finger, C3HC4 type (RING finger)//B-box zinc finger.//SPRY domain 
SPLEN20063250//Zinc finger, C2H2 type 

SPLEN20063890//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZZLeucine Rich RepeatZZLeucine 

Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich Repeat 

SPLEN20071 820ZZDNA polymerase X family 

SPLEN20076470ZZTPR DomainZZTPR DomainZZTPR Domain 

SPLEN20080070ZZAIpha-L-fucosidase 

SPLEN20085910ZZDouble-stranded RNA binding motif 

SPLEN20090880ZZImmunoglobulin domain 

SPLEN20098030ZZZinc finger, C3HC4 type (RING finger)ZZB-box zinc finger. 
SPLEN20101950ZZSodiumZhydrogen exchanger family 
SPLEN20104150ZZRibosomal protein L36 
SPLEN20108000Z/short chain dehydrogenase 
SPLEN2011 01 80ZZTransposase 

SPLEN20118050ZZLeucine rich repeat N-terminal domainZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich 
RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZ/Leucine Rich Repeat 
SPLEN20135030ZZPDZ domain (Also known as DHR or GLGF).ZZGATA zinc fingerZZLIM domain containing proteins 
SPLEN20139100ZZSodium and potassium ATPasesZZImmunoglobuiin domainZZImmunoglobulin domainZZImmunoglob- 
ulin domain/Zlmmunoglobulin domain 

SPLEN20139360ZZBacterial regulatory proteins, lacl familyZZSite-specific recombinases 
SPLEN20175920ZZUncharacterized protein family UPF0036 
SPLEN201 80980ZZGIutathione S-transferases. 

SPLEN201 82990Z/BTBZPOZ domain/ZKelch motifZZKelch motifZZKelch motifZZKelch motif 
SPLEN20183020ZZLaminin G domain 
SPLEN20191020ZZSrc homology domain 2 

SPLEN20193790ZZDynamin familyZZDynamin familyZZProteasome activator pa28 beta subunitZZPeroxidaseZZDynamin 
central regionZZDynamin GTPase effector domain SPLEN20195710Z/TPR DomainZZTPR DomainZZTPR Domain 
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SPLEN20197930//K-box region 

SPLEN201 98390//WD domain, G-beta repeat//Fibrillarin//WD domain, G-beta repeat//WD domain, G-beta repeat//WD 
domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
SPLEN20201830//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat/YLeucine Rich RepeatZ/Leucine 
Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich Repeat 
TEST1 1 0001 570//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2type//Zinc finger, C2H2type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//TRAF- 
type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Tran- 
scription factor S-il (TFIIS)//Zmc finger, C2H2 type 

TESTI20006160//Large-conductance mechanosensitive channel, MscL//CbiM 
TESTI20006830//PWWP domain 
TESTI20012080//Chitin synthase 
TESTI20016970//TPR Domain 

TESTI20030200//Double-stranded RNA binding mot if//Adenosine- deaminase (editase) domain//Adenosine-deami- 
nase (editase) domain 

TESTI20030440//Plant PEC family metallothionein/ZTropomyosins 
TESTI20031310//Serpins (serine protease inhibitors) 
TESTI20038240//Peptidase family M13 

TESTI20041630//Outer membrane efflux protein//lntermediate filament proteins 

TESTI20043910//IQ calmodulin-binding motif//IQ calmodulin-binding motif//IQ calmodulin-binding motif//IQ calmodu- 
lin-binding motif//IQ calmodulin-binding motif 

TESTI20045390//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain 
TESTI20046110//Extracellular link domain 

TESTI20046490//LIM domain containing proteins//Somatomedin B domain 
TESTI20046870//CRAUTRIO domainV/CRAL/TRIO domain.//MSP (Major sperm protein) domain 
TESTI20046890//PHD-finger 
TESTI20049060//lmmunoglobulin domain 
TESTI20049410//Proprotein convertase P-domain 

TESTI20050720//Coenzyme A transferase//Alpha-2-macroglobulin family N-terminal region//Coenzyme A transferase 

TESTI20051730//Glutamine amidotransferases class-ll//alpha/beta hydrolase fold 

TESTI20053070//WD domain, G-beta repeat//WD domain, G-beta repeat 

TESTI20053950//IQ calmodulin-binding motif 

TESTI20055880//Serum amyloid A protein 

TESTI20056030//Myosin tail 

TESTI20057430//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

TESTI20057590//Leucine rich repeat C-terminal domain//lmmunoglobulin domain 
TESTI20057840//SAP domain//Zinc knuckle//Zinc finger, C3HC4 type (RING finger) 
TESTI20057880//Zinc finger, C3HC4 type (RING finger) 

TESTI20058350//Polyomavirus coat protein//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G- 
beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
TESTI20058920//Tubulin/FtsZ family 

TESTI20059080//Thermophilic metalloprotease (M29)//Hyaluronidase 
TESTI20059480//Cyclic nucleotide-binding domain 

TESTI200598101/KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-l I (TFIIS)//Zinc fin- 
ger, C2H2 type//DM DNA binding domain//Zinc finger, C2H2 type//Zinc finger, C2H2 type//BolA-like protein//Zinc finger, 
C2H2 type//Coronavirus M matrix/glycoprotein//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20060350//bZIP transcription factor 

TESTI20060830/IZAP domain//Ferric uptake regulator family//Peptidase family M1//Piwi domain 
TESTI20061090//Kerat»n, high sulfur B2 protein 
TESTI20061200//Sugar (and other) transporter 

TESTI20064370//TPR Domain//TPR Domain//TPR Domain//TPR Domain//Synaptobrevln 

TESTI20064530//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
TESTI20064650//Myosin head (motor domain) 
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TESTI200656507/G-protein alpha subunit 

TESTI20066150//Picornavirus 2B proteirV/Glutamine amidotransferase class-l//Pancreatic hormone peptides 
TESTI20066330//Fibronectin type ill domain 
TESTI20066650//RasGEF domain 
5 TESTI20067480//KRAB box//Zinc finger, C2H2 type//TRAF-type zinc finger//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//FYVE zincfinger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, 
C2H2 type 

TESTI20068530//Zinc finger, C3HC4 type (RING finger)//PHD-finger 
10 TESTI20071130//ATP synthase Alpha chain, C terminal 

TESTI20071630//Glutamine synthetase//SCP-like extracellular protein 

TESTI20075240//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Putative zinc finger in N-recognin//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 
15 type 

TESTI20076570//Dual specificity phosphatase, catalytic domain 

TESTI20079220//KRAB box//Myb-like DNA-binding dornain//Myb-like DNA-binding domain//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type 
20 TESTI20079980//PDZ domain (Also known as DHR or GLGF) .//Domain found in Dishevelled, Egl-1 0, and Pleckstrin 
TESTI20081890//PDZ domain (Also known as DHR or GLGF). 
TESTI20084250//Oxysterol-binding protein 
TESTI20086840//von Willebrand factor type A domain 

TESTI20088840//Zinc finger, C3HC4type (RING finger)//PHD-finger//Thymidine kinases//E7 protein, Early protein// 
25 CONSTANTS family zinc finger//B-box zinc finger.//SPRY domain 
TESTI20092170//ENV polyprotein (coat polyprotein) 
TESTI20095200//7TM chemoreceptor 

TESTI20095770//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
30 TESTI20095880//Domain of unknown function DUF33//Penicillin am idase// Formate/nitrite transporter//Sodium:galac- 
toside symporter family TESTI20099350//GGL domain//Clusterin//Biopterin-dependent aromatic amino acid hydroxy- 
lase 

TESTI20100090//Lectin C-type domain 

TESTI20104090//TEA domain 
35 TESTI201 0591 07/Notch (DSL) domain// Amilo ride-sensitive sodium channel 

TESTI20106170//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 

G-beta repeat//WD domain, G-beta repeat 

TESTI20i068207/Protein kinase C terminal domain 

TESTI20108060//Ser/Thr protein phosphatase 
40 TESTI2011 2540//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich Repeat//Chorismate 

synthase//UvrB/uvrC motif 

TESTI20112860//Eukaryotic protein kinase domain 

TESTI20113940//Divalent cation transporter 

TESTI20114480//Zinc finger, C4 type (two domains)//Zinc finger, C2HCtype//SAM domain (Sterile alpha motif)//Sterile 
45 alpha motif (SAM)/Pointed domain 

TESTI20116050//UBX domain 

TESTI20120500//Kelch motif//Kelch motif 

TESTI20120900//DNA gyrase/topoisomerase IV, subunit A 

TESTI20121040//Protein phosphatase 2C//Protein phosphatase 2C 
so TESTI201220707/ELM2 domain//Myb-like DNA-binding domain 

TESTI20125280//lmmunoglobulin domain//lmmunoglobulin domain 

TESTI20125920//PCI domain 

TESTI20126280//recA bacterial DNA recombination proteins 
TESTI20131440//Carboxypeptidase activation peptide/ZZinc carboxypeptidase 
55 TESTI20132310//Ubiquitin carboxyl -terminal hydrolase family 2 

TESTI20134680//MYND finger//B-box zinc fingery/CONSTANS family zinc finger//Putative zinc finger in N-recognin 
TESTI20134970//Double-stranded RNA binding motif//Aldehyde oxidase and xanthine dehydrogenase, C terminus// 
Adenosine-deaminase (editase) domain 
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TESTI201 40970//!mmunoglobulin domain 
TESTI20145780//Src homology domain 2 

TESTI20148380//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain//PPR repeat// 
Neuraminidases//ATP synthase Alpha chain, Ctemriinal/ZTPR Domain//TPR Domain//TPR Domain//TPR Domaln//PPR 
5 repeat//TPR Domain 

TESTI20150420//RhoGAP domain 

TESTI20150920//Leucine Rich Repeat//Leucine Rich Repeat//Leucine Rich RepeatZ/Leucine Rich RepeatZ/Leucine 
Rich Repeat 

TESTI20153310//lntermediate filament tail domain 
10 TESTI201 627807/Divalent cation transporter 

TESTI20162980//Ubiquitin family//Retrovirai aspartyl protease//Retroviral aspartyl protease 

TESTI20164210//lsocitrate and isopropylmalate dehydrogenases//Ribosomal protein S27a//TILa domain//von Wille- 
brand factor type C domain 
TESTI20165990//Ribosomal protein L36 
15 TESTI201 662907/Zinc finger, C2H2 type//FAD binding domain//Phosphoenolpyruvate carboxykinase//Ribosomal pro- 
tein S11 

TESTI20166670//Zinc finger C-x8-C-x5-C-x3-H type (and similar). 
TESTI20169500//GGL domain 
TESTI20170280//Flagellar L-ring protein 
20 TESTI201 73960//KRAB box//Zinc finger, C2H2 type//2inc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

TESTI20179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
TESTI20182760//Amiloride-sensitive sodium channel 

25 TESTI201 83680//Gas vesicles protein GVPc repeated domain 

TESTI20184750//Laminin G domain//Thrombospondin N-terminal -like domains//Laminin G domain 
TESTI20184760//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Transcription 
factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20186110//Divalent cation transported/Translation initiation factor IF-3//Divalent cation transporter 

30 TESTI201 93080//Growth-Arrest-Specif ic Protein 2 Domain 
TESTI20194880//SAP domain 

TESTI20196690//Gfycine cleavage T-protein (aminomethyl transferase) 
TESTI20197030//Pancreatic hormone peptides 

TESTI201 97600//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
35 TESTI201 991 1 0//Disintegrin 

TESTI20205100//DNA gyrase/topoisomerase IV, subunit A 

TESTI20205250//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif//MORN motif 
TESTI20207170//Nucleosome assembly protein (NAP) 
TESTI20210570//CRALTTRIO domain. 

TESTI20212970//DEAD/DEAH box heiicase//Helicases conserved C-terminal domain 
TESTI20219110//Eukaryotic protein kinase domain 

TESTI20222030//Hemagglutinin//ATP synthase Alpha chain, C terminal//AMP-binding enzyme 
TESTI20222460//intermediate filament proteins 

TESTI20227380//DEAD/DEAH box helicase//Heiicases conserved C-terminal domain 
45 TESTI202281 20//RhoGAP domain 

TESTI20228740//Zinc finger, C2H2 type 

TESTI20244220//Cecropin famiiy//Fes/CIP4 homology domain//Hr1 repeat motif//SH3 domain 
TESTI20244430//Ank repeat//Ank repeat//Ank repeat//SAM domain (Sterile alpha motif) 

TESTI20244460//pKID domain//Adenylate kinase//Thymidylate kinase//ATPases associated with various cellular ac- 
50 tivities (AAA) 

TESTI20246480//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
TESTI20251610//Chitin synthase//von Willebrand factor type A domain 
TESTI20252690//Domain found in Dishevelled, Egl-10, and Pleckstrin 

TESTI20254030//LIM domain containing proteins//LIM domain containing proteins//Villin headpiece domain 
55 TESTI20254990//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20255460//ZAP domain//Piwi domain 

TESTI20257910//Class I Histocompatibility antigen, domains alpha 1 and 2//lmmunoglobulin domain 
TESTI20258720//Ank repeat//Ank repeat//Ank repeat 
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TESTI202591101/Zinc finger, C3HC4 type (RING finger)//PHD-finger 
TESTI20261040//DNA polymerase (viral) C-terminal domain 
TESTI20261160//PH domain 

TESTl20261680//Hsp20/alpha crystallin family//Granulins 
TESTI20262150//lon transport protein 
TESTI20262940//Phosphofructokinase 

TESTI20264530//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//C.elegans integral membrane protein Srb//WD domain, G-beta repeat// 
WD domain, G-beta repeat//SAM domain (Sterile alpha motif)//Sterile alpha motif (SAM)/Pointed domain 
TESTI20264910//Uteroglobin family 

TESTI20266050//Zinc finger, C3HC4 type (RING finger)//SPRY domain 

TESTI20274960//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20278280//PMP-22/EMP/MP20/Claudin family 

TESTI20282530//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TESTI20284260//Histone-like transcription factor (CBF/NF-Y) and archaeal histone//Core histone H2A/H2B/H3/H4 
TESTI20285230//Adenosine-deaminase (editase) domain 
THYMU1 0004280//NHL repeat//NHL repeat 
THYMU20006020//lsocitrate and isopropylmalate dehydrogenases 
THYMU20009500//TPR Domain 

THYMU20013250//LIM domain containing proteins//RI01/ZK632.3/MJ0444 family//Eukaryotic protein kinase domain 
THYMU20018250//TPR Domain 

THYMU20019260//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
THYMU20021090//SAM domain (Sterile alpha motif )//Sterile alpha motif (SAM)/Pointed domain 
THYMU20028150//!mmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 

THYMU20028410//BRCA1 C Terminus (BRCT) domain//BRCA1 C Terminus (BRCT) domain 
THYMU20031330//4Fe-4S iron sulfur cluster binding proteins, NifH/frxC family 

THYMU20032820//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

THYMU20039320//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat 

THYMU20046350//Cytochrome C and Quinol oxidase polypeptide I 
THYMU20049060//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 

THYMU20052830//lmmunoglobulin domairV/lmmunoglobulin domain//lmmunogiobulin domain//lmmunoglobulin do- 
main 

THYMU20055450//Zona pellucida-like domain 
THYMU20055460//Putative esterase 

THYMU20055760//Na+/K+ ATPase C-terminus//Phospholipase A2 
THYMU20062770//Zona pellucida-like domain 

THYMU20063650//Ribulose-phosphate 3 epimerase family//! ndole-3-glycerol phosphate synthases 
THYMU20066660//DEAD/DEAH box helicase 

THYMU20070250//Transketolase//Dehydrogenase E1 component//Transketolase 

THYMU20071120//KRABbox//Zincfinger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

THYMU20077250//HMG (high mobility group) box 
THYMU20081110//LIM domain containing proteins 
THYMU20083390//11 S plant seed storage protein 

THYMU20090230//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
THYMU20095920//lron hydrogenase small subunit 
THYMU20097920//PH domain//FERM domain (Band 4.1 family) 

THYMU20098350//bZIP transcription factor//bZIP transcription factor//Tubulin/FtsZ family//lntermediate filament pro- 
teins 

THYMU20099060//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
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main//lmmunoglobulin domain 

THYMU20100940//Protein of unknown function DUF132 
THYMU20104480//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
THYMU201 06990//PH domain 

THYMU20110720//Aminotransferases class-Ill pyridoxal-phosphate 
THYMU20112590//Acyl-CoA dehydrogenase//Adaptin N terminal region 

THYMU20120240//Ubiquitin carboxyl-termina! hydrolases family 2//Ubiquitin carboxyl-terminal hydrolase family 2 
THYMU20120730//Aldehyde dehydrogenase family 

THYMU20121040//bZIP transcription factor//EF-1 guanine nucleotide exchange domain 
THYMU201 391 60//Uncharacterized protein family UPF0031 
THYMU20143230//EGF-like domain//Extracellular linkdomain//Fasciclin domain 
THYMU20145990//SH3 domain 

THYMU20153210//7 transmembrane receptor (Secretin family) 
THYMU201 70230//Glycine cleavage T-protein (aminomethyl transferase) 

THYMU201 7601 0//WD domain, G-beta repeat//PQQ enzyme repeat//WD domain, G-beta repeat//WD domain, G-beta 
repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta re- 
peat//WD domain, G-beta repeat 

THYMU20178440//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobulin do- 
main 

THYMU20184550//HSF-type DNA-binding domain//bZIP transcription factor 
THYMU20191970//Cadherin domain//Cadherin domain//Cadherin domain//Cadherin domain 
TKIDN1 0000620//Thioredoxin 

TKIDN1 0001 920//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
TRACH2001 1 01 0//5' -nucleotidase 

TRACH20021380//Copper/zinc superoxide dismutase (SODC)//Adenylate and Guanylate cyclase catalytic domain// 
Adenylate and Guanylate cyclase catalytic domain 

TRACH20029880//MORN motif//MORN motif//Penicillin amidase//Bacterial regulatory proteins, lad family//Vacuolar 
sorting protein 9 (VPS9) domain 

TRACH20040390//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family 
RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, 
Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio- 
family RNA binding domains (aka PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka 
PUM-HD, Pumilio homology domain)//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain) 
//Pumilio-family RNA binding domains (aka PUM-HD, Pumilio homology domain) 
TRACH20043360//Kinesin motor domain//Caspase recruitment domain//Ribosomal protein L35 
TRACH20058000//Fibronectin type III domain 
TRACH20090060//C2 domain 
TRACH20091070//Aldehyde dehydrogenase family 
TRACH20093400//Adaptin N terminal region 
TRACH20098510//Ribosomal L29 protein 
TRACH20104510//Uncharacterized protein family UPF0005 

TRACH20108240//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 
RBD, or RNP domain) 

TRACH20113020//AIR synthase related protein 

TRACH20122980//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain 
TRACH20131230//PH domain/ZOxysterol-binding protein 
TRACH20139280//PX domain 

TRACH201 4371 ©//Leucine Rich RepeatZ/Leucine Rich Repeat//Leucine Rich Repeat 
TRACH20149500//Zinc finger//Plexin repeatZ/Cysteine rich repeat//lntermediate filament proteins 
TRACH20149740//Sodium:dicarboxylate symporter family 

TRACH201 63470//Putative integral membrane protein DUF46//Sugar (and other) transporter//Sodium:galactoside 
symporter family 

TRACH20164100//Retroviral aspartyl protease 
TRACH20164810//D-isomer specific 2-hydroxyacid dehydrogenases 
TRACH201 67090//Chitinases, family 2 

TRACH20170860//lmmunoglobulin domain//lmmunoglobulin domain//lmmunoglobuiin domain//lmmunoglobulin do- 
main 
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TRACH201 88350//T ropomyosins 
TRACH20190460//Lipase (class 3) 

UTERU20000950//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat 
5 UTERU2001 6580//Zinc finger present in dystrophin, CBP/p300//Myb-like DNA-binding domain 

UTERU20026620//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger, C2H2 type 

UTERU20041630//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Transcription factor S-H (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
10 UTERU20083020//Domain of unknown function DUF71 

UTERU20086530//Lipocalin / cytosolic fatty-acid binding protein family 
UTERU20087070//Sushi domain (SCR repeat)//Trypsin 

UTERU20089390//TPR Domain//TPR Domain//TPR Domain//TPR Domain//TPR Domain/ZTPR Domain//TPR Do- 
main//TPR Domain 
is UTERU20089620//Fringe-like 

UTERU20099040//Cation efflux family 

UTERU20099510//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//PHD-finger//Zinc finger, C2H2 type 
20 UTERU201 0431 0//RN A polymerases K / 1 4 to 1 8 kDa subunit 
UTERU201 2 1 1 40//Rhodanese-like domain 

UTERU20122520//FERM domain (Band 4.1 family)//FERM domain (Band 4.1 family) 
UTERU20127030//Xylose isomerase//ApbE family 

UTERU20127150//Translation initiation factor IF-3//Divalent cation transporter 
25 UTERU20128560//Domain of unknown function DUF28 
UTERU20132620//HMG14 and HMG17 

UTERU20139760//Mitochondrial carrier proteins//Mitochondrial carrier proteins 
UTERU20168960//PH domain//Methanol dehydrogenase beta subunit 
UTERU201 81 270//Zinc knuckle 
30 UTERU201 85220//Bromodomain 

EXAMPLE 6 

Functional categorization based on the full-length nucleotide sequences 

35 

[0224] The functional prediction and categorization of the proteins encoded by the clones were carried out based 
on the result of homology search of the databases of GenBank, Swiss-Prot, UniGene and nr (see the Homology Search 
Result Data) for the full-length nucleotide sequences and the result of domain search of the amino acid sequences 
deduced from the full-length nucleotide sequences (see Example 5). 

40 [0225] The clone predicted to belong to the category of secretory protein/membrane protein means a clone having 
hit data with some annotation, such as growth factor, cytokine, hormone, signal, transmembrane, membrane, extra- 
cellular matrix, receptor, G-protein coupled receptor, ionic channel, voltage-gated channel, calcium channel, cell ad- 
hesion, collagen, connective tissue, etc., suggesting that it is a secretory or membrane protein, or means a clone in 
which the presence of nucleotide sequence encoding a signal sequence or transmembrane domain was suggested 

45 by the results of PSORT and SOSUI analyses for deduced ORF. 

[0226] The clone predicted to belong to the category of glycoprotein-related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoprotein-related protein. 
[0227] The clone predicted to belong to the category of signal transduction-related protein means a clone having hit 
data with some annotation, such as serine/threonine-protein kinase, tyrosine-protein kinase, SH3 domain, SH2 domain, 

50 etc., suggesting that the clone encodes a signal transduction-related protein. 

[0228] The clone predicted to belong to the category of transcription-related proteinemeans a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc., suggesting that the clone encodes 
a transcription-related protein. 

[0229] The clone predicted to belong to the category of disease-related protein means a clone having hit data with 
55 some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or means a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, or UniGene, 
where the hit data corresponds to genes or proteins which have been deposited in the Online Mendelian Inheritance 
in Man (OMIM) (http://www.ncbi. nlm.nih.gov/Omim/), which is the human gene and disease database. 
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[0230] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism- related protein. 

[0231] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
5 a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 

apoptosis, etc., suggesting that the clone encodes a cell division and/or cell proliferation-related protein. 

[0232] The clone predicted to belong to the category of cytoskeleton-related protein means a clone having hit data 

with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 

clone encodes a cytoskeleton-related protein. 
10 [0233] The clone which is predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein 

means a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA 

helicase, polyadenylation, etc., suggesting that the clone encodes a nuclear protein and/or RNA synthesis-related 

protein. 

[0234] The clone predicted to belong to the category of protein synthesis and/or transport- related protein means a 
15 clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting that the clone encodes a protein 
synthesis and/or transport- related protein. 

[0235] The clone predicted to belong to the category of cellular defense-related protein means a clone having hit 
data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
20 cellular defense-related protein. 

[0236] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting that the clone encodes 
a development and/or differentiation- related protein. 

[0237] The clone predicted to belong to the category of DNA-binding and/or RNA-binding protein means a clone 
25 having hit data with some annotation, such as DNA-binding, RNA-binding, etc. 

[0238] The clone predicted to belong to the category of ATP-binding and/or GTP-binding protein means a clone 
having hit data with some annotation, such as ATP-binding, GTP-binding, etc. 

[0239] In this functional categorization, when a single clone corresponded to multiple categories of those shown 
above, the clone was assigned to the multiple categories. However, the function of a protein is not restricted to the 
30 functional category in this classification, and there is the possibility that other functions are newly assigned to the protein. 
[0240] The clones predicted to belong to the category of secretory protein and/or membrane protein are the following 
516 clones. 

ADRGL20020290, ADRGL20021910, ADRGL20036380, ADRGL20036840, ADRGL20059610, ADRGL20063770, 
ADRGL20066770, ASTRO200 10010, ASTRO20020240, ASTRO20045840, ASTRO20053430, ASTRO20055530, 

35 ASTRO20055570, ASTRO20055930, ASTRO20088950, ASTRO20091180, BNGH420021680, BNGH420023870, 
BNGH420046790, BNGH420052350, BNGH420059680, BNGH4200 75940, BNGH420077980, BRACE1 0000510, 
BRACE20051930, BRACE20052530, BRACE20054080, BRACE20066360, BRACE20068710, BRACE20069000, 
BRACE20069110, BRACE20 194670, BRACE20204670, BRACE2021 6950, BRAMY1 0001 730, BRAMY20003880, 
BRAMY20013670, BRAMY20024790, BRAMY20027390, BRAMY20028530, BRAMY20035380, BRAMY20044920, 

40 BRAMY20045210, BRAMY20047560, BRAMY20050940, BRAMY20053910, BRAMY20055760, BRAMY20072440, 
BRAMY20083820, BRAMY20089770, BRAMY20091230, BRAMY20094890, BRAMY20096930, BRAMY2011841 0, 
BR AMY201 23400, BRAMY20 125550, B RAM Y201 27310, BRAMY20 127760, BRAMY201 35720, BRAMY20 137360, 
BRAMY201 39440, BRAMY201 52510, BR AM Y20 194680, BRAMY20204270, BRAMY20225320, BRAMY20237190, 
BRAMY20245140, B RAM Y2025 1750, BRAMY20285650, BRAWH20020470, BRAWH2002191 0, BRAWH20026010, 

45 BRAWH20030000, BRAWH20039640, BRAWH20055330, BRAWH20078620, BRAWH20093070, BRAWH20 185270, 
BRCAN10000760, BRCAN1 0001 680, BRCAN20001480, BRCAN20004180, BRCAN20005230, BRCOC20000470, 
BRCOC20003600, BRHIP1 0000720, BRHIP1 0001040, BRHIP20000210, BRSSN20001970, BRSSN20074640, 
BRSSN20091190, CD34C20001 750, CTONG200 17490, CTONG20036990, CTONG20041260, CTONG20044870, 
CTONG20045500, CTONG20049480, CTONG20051450, CTONG20055850, CTONG20056150, CTONG20059130, 

50 CTONG20060040, CTONG20063770, CTONG20065680, CTONG20068360, CTONG20069320, CTONG20071 680, 
CTONG20076810, CTONG20078340, CTONG20079590, CTONG20083980, CTONG20084020, CTONG20085210, 
CTONG201 67750, CTONG201 68240, CTONG201 79890, CTONG20 183830, CTONG201 84830, DFNES2001 8000, 
DFNES20029660, DFNES20057660, DFNES20072990, DFNES20080880, FCBBF20018680, FCBBF20029280, 
FCBBF20032930, FCBBF20036360, FCBBF20054390, FCBBF30004340, FCBBF30022680, FCBBF30029250, 

55 FCBBF30042610, FCBBF30062490, FCBBF30075970, FCBBF30078600, FCBBF30091520, FCBBF30095410, 
FCBBF301 05440, FCBBF301 18670, FCBBF301 32660, FCBBF301 35890, FCBBF301 45670, FCBBF301 64510, 
FCBBF301 69870, FCBBF30171230, FCBBF301 72330, FCBBF301 77290, FCBBF30 179740, FCBBF30 195690, 
FCBBF301 97840, FCBBF302 12210, FCBBF30223110, FCBBF30223210, FCBBF30225930, FCBBF3023061 0, 
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FCBBF30260480, FCBBF30266510, FCBBF30287940, FCBBF50000610, FCBBF50004950, FEBRA2000782.0, 
FEBRA20018670, FEBRA20031280, FEBRA20031 810, FEBRA20038220, FEBRA20039260, FEBRA20040230, 
FEBRA20040560, FEBRA20046280, FEBRA20080860, FEBRA20084750, FEBRA20088810, FEBRA201 15930, 
FEBRA201 16650, FEBRA20121950, FEBRA20141980, FEBRA201 77800, FEBRA201 82030, FEBRA20191720, 

5 HCHON1 0001 660, HCHON20015050, HEART1 0001 490, HEART20031680, HHDPC10001140, HHDPC20051850, 
HHDPC20082790, HHDPC200881 60, HLUNG20015070, HLUNG20015180, HLUNG20020850, HLUNG20029490, 
HLUNG20032460, HLUNG20033350, HLUNG20034970, HLUNG20037160, HLUNG20041540, HLUNG20042730, 
HLUNG20050760, HLUNG20052300, HLUNG20060670, HLUNG20065990, HLUNG20074330, HLUNG20081390, 
HLUNG20088750, HLUNG20092530, KIDNE20016360, KIDNE20083150, KIDNE20084030, KIDNE20084040, 

10 KIDNE20084800, K1DNE20086490, KIDNE20086660, KIDNE20094670, KIDNE20134130, KIDNE201 42900, 
KIDNE201 43200, KIDNE201 48080, KIDNE201 60960, KIDNE201 63710, KIDNE20169180, KIDNE201 82540, 
KIDNE201 861 70, KIDNE201 88630, KIDNE201 89960, LIVER20007750, LIVER2001 051 0, LIVER2001 0990, 
LIVER20026440, LIVER20030650, LIVER20038000, MESAN20007110, MESAN20008150, MESAN20021220, 
MESAN20027900, MESAN20058110, MESAN20059570, MESAN20060430, MESAN20067430, MESAN20084150, 

15 MESAN20095220, NT2NE20018740, NT2NE20021860, NT2NE20039210, NT2NE20053230, NT2NE20059210, 
NT2NE20064780, NT2NE20069580, NT2NE20080770, NT2NE20082130, NT2NE20092950, NT2NE20140130, 
NT2NE201 45250, NT2NE20 146510, NT2NE201 52620, NT2NE201 67660, NT2NE20181800, NT2R 12001 6240, 
NT2RI20021200, NT2RI20033920, NT2RP70003110, NT2RP 70027790, NT2RP70031 070, NT2RP70031480, 
NT2RP70056690, NT2RP70087140, NTONG20034540, NTONG20053630, OCBBF20000740, OCBBF20012520, 

20 OCBBF201 09780, OCBBF201 10210, OCBBF201 10730, OCBBF201 12280, OCBBF201 18720, OCBBF201 20010, 
OCBBF201 23200, OCBBF20 155030, OCBBF201 55900, OCBBF201 65910, OCBBF201 70350, OCBBF201 76650, 
OCBBF201 85630, OCBBF20191950, PANCR 10000860, PEBLM20001800, PLACES 000 1290, PLACE60004260, 
PLACE60006300, PLACE60053280, PLACE60055590, PLACE60056910, PLACE60057860, PLACE60061370, 
PLACE60064740, PLACE60070500, PLACE60087680, PLACE 60 104630, PLACE601 07010, PLACE601 13340, 

25 PLACE601 38840, P LAC E60 154450, PLACE60 184870, PROST1 0001100, PROST20011160, PROST20014150, 
PROST20035830, PROST20045700, PROST20050390, PROST20065100, PROST20073280, PROST20082430, 
PROST20084680, PROST20084720, PROST20099090, PROST20 105450, PROST20206060, PROST20 108850, 
PROST20110120, PROST20114100, PROST20 146590, PROST20 152510, PROST201 68600, PUAEN 10000870, 
SKMUS20006790, SKMUS20020770, SKMUS20073150, SKMUS20091900, SKNMC20006350, SKNSH20094350, 

30 SMINT20006090, SMINT20008110, SMINT20024140, SMINT20028840, SMINT20045470, SMINT20077960, 
SMINT20081330, SMINT20086250, SMINT20088440, SMINT20088690, SMINT20092160, SPLEN20015100, 
SPLEN20017610, SPLEN20017810, SPLEN20024190, SPLEN20024620, SPLEN20054500, SPLEN20058180, 
SPLEN20063890, SPLEN20073880, SPLEN20080070, SPLEN20090880, SPLEN20101 950, SPLEN201 04690, 
SPLEN20105100, SPLEN201 08000, SPLEN20110180, SPLEN201 10860, SPLEN201 18050, SPLEN20121790, 

35 SPLEN201 25230, SPLEN201 36700, SPLEN201 38600, SPLEN20139100, SPLEN201 75920, SPLEN201 77400, 
SPLEN201 82850, SPLEN201 83020, SPLEN201 83950, SPLEN201 90080, SPLEN201 90770, SPLEN201 93230, 
SPLEN201 93490, SPLEN201 93790, SPLEN20201830, SPLEN20204670, TESOP1 0000350, TESTI1 00001 90, 
TESTI200061 60, TESTI20029100, TESTI2003131 0, TESTI20032770, TESTI20038240, TESTI20043130, 
TESTI20043220, TESTI20045390, TESTI20046540, TESTI20046870, TESTI20047370, TESTI20050400, 

40 TESTI20051200, TESTI20051730, TESTI20053260, TESTI20053780, TESTI20057200, TESTI20057590, 
TESTI20059080, TESTI20061 200, TESTI200621 20, TESTI20063330, TESTI2006341 0, TESTI20063600, 
TESTI20066330, TESTI20068530, TESTI 20070400, TESTI20070740, TESTI20073460, TESTI20086840, 
TESTI20095200, TESTI20095440, TESTI20095880, TESTI201 00090, TESTI201 02390, TESTI201 0591 0, 
TESTI201 13940, TESTI20116120, TESTI20121040, TESTI20121710, TESTI20131440, TESTI20 142540, 

45 TESTI201 49880, TESTI20151 800, TESTI201 62780, TESTI201 70170, TESTI201 73050, TESTI20 182760, 
TESTI201 83680, TESTI201 84750, TESTI201 861 1 0, TESTI201 98540, TESTI201 991 1 0, TESTI20202830, 
TESTI20204260, TESTI2021 0030, TESTI2021 4630, TESTI2021 911 0, TESTI20244730, TESTI20245600, 
TESTI20245860, TESTI2024641 0, TESTI20251 61 0, TESTI2025791 0, TESTI20260640, TESTI20261 040, 
TESTI202621 50, TESTI20262940, TESTI2026491 0, TESTI20271 790, TESTI20278280, TESTI20282420, 

50 TESTI20282900, TESTI20286590, THYMU20007020, THYMU20012020, THYMU200 17270, THYMU20020800, 
THYMU20025480, THYMU20028150, THYMU20030690, THYMU20034790, THYMU20046350, THYMU20046770, 
THYMU20050010, THYMU20052830, THYMU20054800, THYMU20055740, THYMU20055760, THYMU20062770, 
THYMU20078240, THYMU20079690, THYMU20083390, THYMU20087270, THYMU201 00940, THYMU201 15380, 
THYMU201 37050, THYMU201 37570, THYMU 20 143230, THYMU20150190, THYMU201 53210, THYMU201 54790, 

55 THYMU201 63600, THYMU20171580, THYMU201 78440, THY M 1)201 85470, TRACH20011010, TRACH20011540, 
TRACH20021380, TRACH20073990, TRACH20081270, TRACH20090060, TRACH201 49720, TRACH20 149740, 
TRACH201 59390, TRACH20 163470, TRACH20 165330, TRACH20 167090, TRACH201 73680, TRACH201 90460, 
UMVEN10001380, UTERU20035770, UTERU20040150, UTERU20045200, UTERU20064120, UTERU20086530, 
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UTERU20087070, UTERU20087850, UTERU20 089300, UTERU20089620, UTERU20095100, UTERU20099040, 
UTERU201 03200, UTERU201 25810, UTERU201 27030, UTERU20127150, UTERU20 139760, UTERU201 88840 
[0241] The clones predicted to belong to the category of glycoprotein-related protein are the following 121 clones. 
ADRGL20020290, ADRGL20036840, ADRGL20059610, AD RGL200 66770, ASTRO20055570, BNGH420046790, 

5 BNGH420077980, BRACE20051930, BRACE20069000, BRACE20204670, BRACE202 16950, BRAMY20013670, 
BRAMY20089770, BRAMY20251210, BRAWH20039640, BRCAN1 0000760, BRCAN20005230, BRCOC20003600, 
CD34C20001750, CTONG20017490, CTONG20036990, CTONG20045500, CTONG20059130, CTONG20079590, 
CTONG20085210, CTONG20 184830, DFNES20018000, DFNES20080880, FCBBF30004340, FCBBF30029250, 
FCBBF30062490, FCBBF30091520, FCBBF301 64510, FCBBF301 71230, FCBBF301 95690, FCBBF30223210, 

10 FEBRA20038220, HCHON20015050, HLUNG20015070, HLUNG20032460, HLUNG20037160, HLUNG20041540, 
KIDNE20142900, KIDNE20169180, KIDNE20186170, KIDNE201 89960, MESAN20021220, MESAN20058110, 
NT2NE20064780, NT2NE20140130, NT2NE20 155650, NT2RP70056690, NTONG20053630, OCBBF20000740, 
OCBBF20012520, OCBBF20110210, OCBBF201 20010, OCBBF201 65900, OCBBF201 65910, OCBBF20191950, 
PEBLM20001800, PLACE60 0004260, PLACE60087680, PUVCE60113340, PLACE601 84870, PROST20033240, 

15 PROST20099090, PROST201 08850, PROST20 146590, SKMUS20073150, SKNMC20006350, SMINT20028840, 
SMINT20056230, SMINT20083290, SMINT20091190, SPLEN20024620, SPLEN20063890, SPLEN20080070, 
SPLEN20090880, SPLEN201 18050, SPLEN20139100, SPLEN201 83020, SPLEN20201830, TESTI1 0000190, 
TESTI20031310, TESTI20043990, TESTI20045390, TESTI20051200, TESTI20057590, TESTI20059080, 
TESTI20066330, TESTI20086840, TESTI201 00090, TESTI201 0591 0, TESTI201 54370, TESTI201 6421 0, 

20 TESTI201 82760, TESTI201 84750, TESTI20199110, TESTI20219110, TESTI20220230, TESTI20245600, 
TESTI20251610, TESTI20257910, TESTI20286590, THYMU20024500, THYMU20028150, THYMU20052830, 
THYMU20062770, THYMU20099060, THYMU201 70080, THY MU201 78440, TRACH20011010, TRACH20011540, 
TRACH20021380, TRACH201 49740, TRACH201 70860, TRACH201 90460, UTERU20086530, UTERU20087070, 
UTERU201 27030 

25 [0242] The clones predicted to belong to the category of signal transduction-related protein are the following 88 
clones. 

ASTRO20050810, ASTRO20052420, ASTRO20085080, ASTRO20090680, BNGH420008150, BNGH4200 15760, 
BNGH420035290, BNGH420086030, BRAMY20035830, BRAMY20043630, BRAMY201 18490, BRAMY20206340, 
BRAMY20244490, BRAMY20251210, BRAMY20263000, BRAWH20093040, BRAWH201 90550, CTONG20004520, 

30 CTONG20029030, CTONG20030280, CTONG20063930, CTONG20070720, CTONG20 189000, FCBBF30001100, 
FCBBF30076310, FCBBF301 00080, FCBBF301 43550, FCBBF30153170, FCBBF301 75350, FCBBF30250980, 
FEBRA20090160, FEBRA201 73330, HCHON20000870, HLUNG20011260, HLUNG20084790, KIDNE20089870, 
KIDNE201 60360, LIVER20011640, MESAN20021130, MESAN20027240, MESAN20065990, NT2NE2001 8890, 
NT2NE20042550, NT2RP70075800, NTONG20043080, NTONG20 048440, PLACE60071 800, PROST20033240, 

35 PROST20052850, PROST20065790, PROST20 075280, SKNSH20052400, SKNSH20057920, SMINT20006020, 
SMINT20035050, SPLEN20023540, SPLEN20039180, SPLEN20048800, SPLEN20049840, SPLEN20054160, 
SPLEN20085910, SPLEN20191020, SPLEN201 98390, TESTI20046490, TESTI20049060, TESTI20053070, 
TESTI20066650, TESTI20081 890, TESTI 20095770, TESTI201 06820, TESTI201 1 2860, TESTI201 45780, 
TESTI201 50420, TESTI201 68880, TESTI20205250, TESTI202281 20, TESTI20244220, TESTI20244460, 

40 TESTI20251740, TESTI20261160, TESTI20264530, THYMU20013250, THYMU20039320, THYMU201 06990, 
THYMU201 45990, THYMU201 70080, THYMU201 76010, TRACH201 88350 

[0243] The clones predicted to belong to the category of transcription-related protein are the following 143 clones. 
ASTRO20038400, ASTRO20075150, BNGH420070370, BNGH420074600, BNGH420087430, BRACE20003310, 
BRACE20061620, BRAMY20001510, BRAMY20040580, BRAMY20076100, BRAMY20111780, BRAWH20040680, 

45 BRAWH20050740, BRAWH20080580, BRAWH 20082920, BRAWH20095900, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG20084660, CTONG20 186370, 
CTONG20 186520, DFNES20028170, DFNES20046840, DFNES20073320, FCBBF30003610, FCBBF3001 9140, 
FCBBF30021900, FCBBF30093170, FCBBF301 14850, FCBBF30 129010, FCBBF301 36230, FCBBF301 43550, 
FCBBF30220050, FCBBF30228940, FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, 

50 FEBRA20027070, FEBRA20046510, FEBRA20057010, FEBRA20063720, FEBRA201 70240, HCHON1 0000150, 
HCHON20002650, HEART200 19310, HLUNG20014590, HLUNG20028110, HLUNG20063700, KIDNE201 40870, 
LIVER20006260, MESAN2001 6270, MESAN20038520, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20061030, NT2NE20079670, NT2NE20082600, NT2RP70001120, NT2RP70029780, NT2RP7004641 0, 
NT2RP70057500, NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF20121910, 

55 OCBBF201 56450, OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF20 177540, PEBLM20003260, 
PLACE60052940, PLACE60066970, PLACE601 22970, PLACE60 150510, PLACE601 77880, PROST20007170, 
PROST20024250, PROST20035170, PROST20 127450, PROST20151370, PROST201 55370, PUAEN 10000650, 
PUAEN20003120, SMINT20011950, SMINT20026200, SMINT20030740, SMINT20039050, SMINT20044140, 
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SMINT20086720, SPLEN20042200, SPLEN20043680, SPLEN20055600, SPLEN20059270, SPLEN20063250, 
SPLEN20098030, SPLEN201 97930, TESTM 0001 570, TESTI20057430, TESTI20057840, TESTI2005981 0, 
TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, TESTI201 04090, 
TESTI201 22070, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, TESTI201 94880, 

5 TESTI201 97600, TESTI20228740, TESTI20254030, TESTI20254990, TESTI20266050, TESTI20274960, 
TESTI20282530, THYMU 10004280, THYMU20019260, THYMU20032820, THYMU20071120, THYMU20077250, 
TKIDN10001920, UTERU200 16580, UTERU20026620, UTERU20041630, UTERU20094830, UTERU20099510, 
UTERU20101150, UTERU201 69020, UTERU20177150, UTERU201 85220, UTERU201 88670 
[0244] The clones predicted to belong to the category of disease-related protein are the following 331 clones. 

10 ADRGL20020290, ADRGL20021910, ADRGL20026790, ADRGL20036840, ADRGL20059610, ADRGL20066770, 
ASTRO20038400, ASTRO20052420, ASTRO20 055570, ASTRO20075150, ASTRO20088950, BNGH420008150, 
BNGH420086030, BRACE1 0000510, BRACE2000331 0, BRACE20069000, BRACE20097540, BRACE201 94670, 
BRACE20196180, BRACE20204670, BRACE20216950, BRAMY20003540, BRAMY20005080, BRAMY20035830, 
BRAMY20040580, BRAMY20043630, BRAMY20044920, BRAMY20051 820, BRAMY20056620, BRAMY20089770, 

15 BRAMY20111780, BRAMY201 52510, BRAMY201 90550, BRAMY20221600, BRAMY20227860, BRAMY20274510, 
BRAWH20082920, BRAWH20093040, BRAWH20095900, BRAWH201 90530, BRAWH20191980, BRCAN1 0000760, 
BRCAN1 0001 050, BRCAN20005230, BRSSN20066440, CTONG20004520, CTONG20029030, CTONG20042640, 
CTONG20045500, CTONG20052780, CTONG20053990, CTONG20070780, CTONG20070910, CTONG20072930, 
CTONG20083980, CTONG20084660, CTONG201 65750, CTONG201 69040, CTONG201 83430, CTONG20 183830, 

20 CTONG201 86290, CTONG201 89000, D FN ES200 16470, DFNES20025500, DFNES20046840, DFNES20055400, 
DFNES20080880, FCBBF1 0000230, FCBBF20035490, FCBBF20066340, FCBBF30002270, FCBBF30002280, 
FCBBF30019140, FCBBF30053300, FCBBF30071500, FCBBF30072440, FCBBF30076310, FCBBF30080730, 
FCBBF301 00080, FCBBF301 15920, FCBBF3011 8670, FCBBF301 29010, FCBBF30 132050, FCBBF30 136230, 
FCBBF30153170, FCBBF301 64510, FCBBF30 166220, FCBBF30171230, FCBBF30 175350, FCBBF301 94550, 

25 FCBBF30220050, FCBBF30223210, FCBBF30259050, FCBBF30263080, FCBBF30275590, FCBBF50001650, 
FEBRA20027070, FEBRA20045380, FEBRA20046200, FEBRA20046510, FEBRA20057010, FEBRA20063720, 
FEBRA20078800, FEBRA20087550, FEBRA20088810, FEBRA20090160, FEBRA20092760, FEBRA20151750, 
FEBRA201 70240, FEBRA201 73330, FEBRA20191720, HCHON10000150, HCHON20015050, HEART20C09590, 
HEART20022200, HEART20063100, HHDPC20081230, HLUNG20008460, HLUNG20014590, HLUNG20032460, 

30 HLUNG20063700, HLUNG20065990, HLUNG20069350, HLUNG20081530, HLUNG20082350, HLUNG20083330, 
HLUNG20085210, KIDNE20081170, KIDNE20084040, KIDNE20088240, KIDNE20089870, KIDNE201 33460, 
KIDNE20134890, KIDNE20141700, KIDNE201 42900, KIDNE20150730, KIDNE20 152440, KIDNE201 60360, 
KIDNE201 65390, KIDNE20169180, KIDNE201 73430, KIDNE20189960, LIVER20026440, MESAN20006200, 
MESAN20021130, MESAN20033220, MESAN20056890, MESAN20057240, MESAN20065990, MESAN20067430, 

35 MESAN20069530, NESOP20004520, NT2NE20018890, NT2NE20026200, NT2NE20037050, NT2NE20053950, 
NT2NE20061030, NT2NE20111190, NT2NE201 17580, NT2NE201 19980, NT2NE20140130, NT2NE20141040, 
NT2RI20093010, NT2RP70003110, NT2RP70046410, NT2RP70075300, NTONG200321 00, NTONG20034540, 
OCBBF20000740, OCBBF20012520, OCBBF20111600, OCBBF2012001 0, OCBBF201 56450, OCBBF201 57970, 
OCBBF20191950, PEBLM20001800, PEBLM20003260, PLACE60004260, PLACE60012620, PLACE60054230, 

40 PLACE60054870, PLACE60062660, PLACE60087680, PLACE60 184870, PROST200 15210, PROST20024250, 
PROST20036350, PROST20050390, PROST20058860, PROST20063430, PROST20065790, PROST20084720, 
PROST20099090, PROST201 20070, PROST20 127450, PROST201 46590, PROST20152510, PROST201 68600, 
PUAEN1 0000650, PUAEN20003120, SKMUS20008730, SKMUS20017400, SKMUS20040440, SKMUS20073590, 
SKMUS20079150, SKNSH20009710, SMINT20002320, SMINT20007470, SMINT20008110, SMINT20011950, 

45 SMINT20016150, SMINT20026200, SMINT20030740, SMINT20049920, SMINT20077960, SMINT20083290, 
SMINT20086250, SMINT20089600, SMINT20091190, SPLEN20023540, SPLEN20024190, SPLEN20042200, 
SPLEN20043680, SPLEN20055600, SPLEN20057830, SPLEN20059270, SPLEN20063890, SPLEN20073500, 
SPLEN20080070, SPLEN20085910, SPLEN20090880, SPLEN20098030, SPLEN201 18050, SPLEN20 136730, 
SPLEN201 38600, SPLEN20139100, SPLEN201 39360, SPLEN201 80980, SPLEN201 87490, SPLEN201 93790, 

50 SPLEN20201830, TESTI1 0000190, TESTI20031310, TESTI20035790, TESTI20041630, TESTI20049060, 
TESTI20050720, TESTI20051200, TESTI20057430, TESTI20057590, TESTI20059080, TESTI200621 20, 
TESTI20067480, TESTI20071 630, TESTI20099350, TESTI201 051 30, TESTI201 0591 0, TESTI201 08060, 
TESTI201 25920, TESTI201 30530, TESTI20131440, TESTI201 34680, TESTI201 42540, TESTI20143180, 
TESTI201 50420, TESTI201 54370, TESTI201 64210, TESTI201 66670, TESTI201 68880, . TESTI20171070, 

55 TESTI201 82760, TESTI201 84750, TESTI201 93080, TEST 1201 94880, TESTI201 96970, TESTI201 97600, 
TESTI20201760, TESTI20207170, TESTI20219110, TESTI20228740, TESTI20244430, TESTI20246480, 
TESTI20251 740, TESTI20252690, TESTI20254030, TESTI2025791 0, TESTI20258720, TESTI20266050, 
TESTI20271790, TESTI20274960, TESTI20282530, TESTI20286590, THYMU 10004280, THYMU20006020, 
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THYMU20013250, THYMU20019260, THYMU20023560, THYMU20028150, THYMU20032820, THYMU20034400, 
THYMU20055460, THYMU20063650, THYMU20070250, THYMU20071120, THYMU20081110, THYMU2009Q230, 
THYMU20095920, THYMU20098350, THYMU20099060, THYMU201 20730, THYMU20121040, THYMU201 70080, 
THYMU201 85650, THYMU20191970, TKIDN 10000620, TKIDN1 0001 920, TRACH20011540, TRACH20091070, 
5 TRACH201 43710, TRACH20 170860, UTERU1 0001060, UTERU20026620, UTERU20041630, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20099510, UTERU201 01150, UTERU201 04310,' UTERU201 27030 , 
UTERU201 85220 

[0245] In particular, hit data of the following 328 clones for Swiss-Prot, or GenBank, UniGene, or nr corresponded 
to genes orproteins which had been deposited in the Online Mendelian Inheritance in Man (OMIM), which is the human 

10 gene and disease database, (the OMIM Number is shown in the parenthesis after the Clone Name). 

ADRGL20020290 (602193), ADRGL20021910 (605717), ADRGL20026790 (605046), ADRGL20036840 (142800), 
ADRGL20059610 (230800;230900;231000;231005), ADRGL20066770 (130660), ASTRO20038400 (604764), 
ASTRO20052420 (600888), ASTRO20055570 (1 76640;1 23400;1 37440;245300;600072), ASTRO200751 50 
(601896), ASTRO20088950 (603202;223000;223100), BNGH420008150 (600050), BNGH420086030 (118423), 

15 BRACE1 0000510 (148021), BRACE20003310 (603899), BRACE20069000 (204200), BRACE20097540 (604908), 
BRACE201 94670 (314375), BRACE20196180 (605535), BRACE20204670 (176884), 

BRACE20216950 (158070), BRAMY20003540 (602142), BRAMY20005080 (604735), BRAMY20035830 (603524), 
BRAMY20040580 (604077), BRAMY20043630 (602775), BRAMY20044920 (603486), BRAMY20051820 (604567), 
BRAMY20056620 (210210), BRAMY20089770 (602566), BRAMY20111780 (604077), BRAMY2015251 0 (176879), 

20 BRAMY201 90550 (600051), BRAMY20221600 (605789), BRAMY20227860 (605416), BRAMY20274510 (180475), 
BRAWH20082920 (603246), BRAWH20093040 (602989), BRAWH20095900 (602277), BRAWH201 90530 (605208), 
BRAWH20191980 (239500), BRCAN1 0000760 (111000), BRCAN10001050 (603696), BRCAN20005230 (603268), 
BRSSN20066440 (603430), CTONG20004520 (603817), CTONG20029030 (602775), CTONG20042640 (103390), 
CTONG20045500 (106195), CTONG20052780 (605612), CTONG20053990 (602187), CTONG20070780 (118990), 

25 CTONG20070910 (604450), CTONG20072930 (314995), CTONG20083980 (601703), CTONG20084660 (600834), 
CTONG20165750 (182465), CTONG20169040 (148030), CTONG20183430 (106410), CTONG20183830 (600382), 
CTONG20186290 (100660), CTONG20189000 (600888), DFNES20016470 (605952), DFNES20025500 (604581), 
DFNES20046840 (6026 17;241 850), DFNES20055400 (603456), DFNES20080880 (602273), FCBBF1 0000230 
(602327), FCBBF20035490 (602489), FCBBF20066340 (603560), FCBBF30002270 (142708), FCBBF30002280 

30 (176763), FCBBF30019140 (602120), FCBBF30053300 (600299), FCBBF30071500 (125485), FCBBF30072440 
(604455), FCBBF30076310 (176892), FCBBF30080730 (600572), FCBBF30100080 (602488), FCBBF30115920 
(603577), FCBBF30118670 (603640), FCBBF30129010 (601260), FCBBF30132050 (603018), FCBBF30136230 
(189909), FCBBF30153170 (171 860; 171 850), FCBBF301 64510 (603006), FCBBF301 66220 (182144), 
FCBBF30171230 (162151), FCBBF30175350 (602521), FCBBF30194550 (182900), FCBBF30220050 (600380), 

35 FCBBF30223210 (300022), FCBBF30263080 (194558), FCBBF30275590 (601403), FCBBF50001 650 (605268), 
FEBRA20027070 (314995), FEBRA20045380 (602942), FEBRA20046200 (106410), FEBRA2004651 0 (604077), 
FEBRA20057010 (602187), FEBRA20063720 (603899), FEBRA20078800 (601825;256000), FEBRA20087550 
(600811), FEBRA20088810 (603725), FEBRA20090160 (600137), FEBRA20092760 (602567), FEBRA20170240 
(314997), FEBRA201 73330 (602990), FEBRA20191720 (603895), HCHON1 00001 50 (300163), HCHON20015050 

40 (151510), HEART20009590 (604581), HEART20022200 (601870), HEART200631 00 (602422), HHDPC20081230 
(164035), HLUNG20008460 (300108), HLUNG20014590 (604077), HLUNG20032460 (176785), HLUNG20063700 
(600210), HLUNG20065990 (186591), 

HLUNG20069350 (114212), HLUNG20081530 (162230), HLUNG20082350 (604677), HLUNG20083330 (120180), 
HLUNG20085210 (604464), KIDNE200811 70 (604535), KIDNE20084040 (602382), KIDNE20088240 (605084), 

45 KIDNE20089870 (602922), KIDNE20133460 (605430), KIDNE20134890 (117143), KIDNE20141700 (312760), 
KIDNE20142900 (188040), KIDNE20150730 (179715), KIDNE20152440 (602194), KIDNE20160360 (602488)! 
KIDNE201 65390 (604649), KIDNE20169180 (191845), KIDNE201 73430 (603831), KIDNE201 89960 (275360), 
LIVER20026440 (601270), MESAN20006200 (151740), MESAN20021130 (600050), MESAN20033220 (600466), 
MESAN20056890 (600813), MESAN20057240 (126380), MESAN20065990 (601959), M ESAN20067430 (191010), 

50 MESAN20069530 (604362), NESOP20004520 (153432), NT2NE20018890 (606031), NT2NE20026200 (277730), 
NT2NE20037050 (300028), NT2NE20053950 (604078), NT2NE20061030 (600834), NT2NE20111190 (602619)! 
NT2NE20117580 (601825;256000), NT2NE20119980 (191161), NT2NE20140130 (601281), NT2NE20141040 
(602917), 

NT2RI20093010 (172460), NT2RP70003110 (1301 60; 194050), NT2RP7004641 0 (601930), NT2RP70075300 
55 (601856), NTONG20032100 (148065; 193900), NTONG20034540 (602658), OCBBF20000740 (602059), 
OCBBF20012520 (602059), OCBBF2011 1600 (147625), OCBBF2012001 0 (605008), OCBBF201 56450 (314997), 
OCBBF201 57970 (604077), OCBBF20191950 (192977), PEBLM20001800 (146900), PEBLM20003260 (194558)! 
PLACE60004260 (601891), PLACE60012620 (214500), PLACE60054230 (300108), PLACE60054870 (160776), 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



PLACE60062660 (606004), 

PLACE60087680 (146732), PLACE601 84870 (172425), PROST20015210 (160745), PROST20024250 (604078), 
PROST20036350 (138295), PROST20050390 (601258), PROST20058860 (182282), PROST20063430 (603292), 
PROST20065790 (171840), PROST20084720 (604426), PROST20099090 (602714), PROST20120070 (602809), 
PROST20127450 (602960), PROST20146590 (158340;1 13720), PROST20152510 (603367), PROST20168600 
(604415), PUAEN10000650 (602960), PUAEN20003120 (601573), SKMUS20008730 (602127), SKMUS20017400 
(191030), SKMUS20040440 (604163), SKMUS20073590 (605834), SKMUS20079150 (605596), SKNSH20009710 
(191030;164970), SMINT20002320 (601644), SMINT20007470 (190370), SMINT20008110 (604384), 
SMINT20011950 (603430), SMINT20016150 (134790), SMINT20026200 (159556), SMINT20030740 (604078), 
SMINT20049920 (600417), SMINT20077960 (137350; 1051 20), SMINT20083290 (146900), SMINT20086250 
(238330), SMINT20089600 (605926), SMINT20091190 (146900), SPLEN20023540 (605577), SPLEN20024190 
(601548), SPLEN20042200 (604167), SPLEN20043680 (126340;234050;278730), SPLEN20055600 (194541), 
SPLEN20057830 (179715), SPLEN20059270 (602165), SPLEN20063890 (600245), SPLEN20073500 (603300), 
SPLEN20080070 (230000), SPLEN20085910 (603424), SPLEN20090880 (142800), SPLEN20098030 (601742), 
SPLEN20118050 (301870), SPLEN20136730 (605412), SPLEN20138600 (603728), SPLEN20139100 (147120), 
SPLEN20139360 (117140), SPLEN201 80980 (156560), SPLEN20187490 (179838), SPLEN20193790 (147150), 
SPLEN20201830 (301870), TESTI10000190 (158340;1 13720), 

TESTI20031310 (107280), TESTI20035790 (601940), TESTI20049060 (603889), TESTI20050720 (245050), 
TESTI20051200 (602273), TESTI20057430 (194532), TESTI20057590 (601890), TESTI20059080 (604038), 
TESTI20062120 (604212), TESTI20067480 (602277), TESTI20071630 (602692), TESTI20099350 (160776), 
TESTI20105130 (310400), TESTI20105910 (601328), TESTI20108060 (600590), TESTI20125920 (601934), 

TESTI20131440 (114850), TESTI20134680 (117143), TESTI20142540 (137960), 
TESTI201 50420 (602732), TESTI201 54370 (600936), TESTI201 64210 (602319), 
TESTI20168880 (151410), TESTI201 71070 (604064), TESTI20182760 (601328), 
TESTI20193080 (602128), TESTI20194880 (602260), TESTI20196970 (601117), 



TESTI201 30530 (146680) 

TESTI20143180 (117143), 

TESTI201 66670 (142968) 

TESTI201 84750 (150320) 

TESTI201 97600 (604167), TESTI20201760 (602162) 



TESTI20207170 (480100), TESTI20219110 (601890), 
TESTI20228740 (604027), TESTI20244430 (182900), TESTI20246480 (601486), TESTI20251 740 (602731), 
TESTI20252690 (601368), TESTI20254030 (602330), TESTI20257910 (142871), TESTI20258720 (182900), 
TESTI20266050 (109092), TESTI20271 790 (604678), TESTI20274960 (194558), TESTI20282530 (604077), 
TESTI20286590 (147267), THYMU10004280 (602290), THYMU20006020 (601149), THYMU20013250 (601988), 
THYMU20019260 (603899), THYMU20023560 (142765), THYMU20028150 (190197), THYMU20032820 (604077), 
THYMU20034400 (604449), THYMU20055460 (133280), THYMU20063650 (180480), THYMU20070250 (277730), 
THYMU20071120 (603899), THYMU20081110 (602567), THYMU20090230 (602324), THYMU20095920 (605349), 
THYMU20098350 (148040;131760;131800;131900), THYMU20099060 (146900), THYMU20120730 (100660), 
THYMU20121040 (130592), THYMU20170080 (604964), THYMU20185650 (602121 ; 124900), THYMU20191 970 
(604265), TKIDN10000620 (605072), TKIDN10001 920 (603899), TRACH20011540 (191155), TRACH20091 070 
(100660), TRACH20143710 (601905), TRACH20170860 (147170), UTERU10001060 (311040), UTERU20026620 
(314997), UTERU20041630 (602277), 

UTERU20086530 (173310), UTERU20087070 (216950), UTERU20087850 (605248), UTERU2009951 0 (604077), 
UTERU20101150 (164012), UTERU20104310 (604414), UTERU20127030 (150325), UTERU20 185220 (600014) 
[0246] The clones predicted to belong to the category of enzyme and/or metabolism-related protein are the following 
219 clones. 

ADRGL20059610, ASTRO20026320, ASTRO20050810, ASTRO20088950, BNGH420008150, BNGH420035290, 
BNGH420074600, BRACE20050870, BRACE20097540, BRACE20200770, BRACE20204670, B R AC E202 15410, 
BRAMY20003540, BRAMY20005080, BRAMY20027990, BRAMY20028620, BRAMY20044920, BRAMY20055760, 
BRAMY20056620, BRAMY20072870, BRAMY20093490, BRAMY20096930, BRAMY201 18490, BRAMY20 125360, 
BRAMY201 43870, BRAMY20 152510, BRAMY20231150, BRAMY20244490, BRAMY20251210, BRAWH20021910, 
BRAWH20082920, BRAWH 20 093040, BRAWH20094900, BRAWH20183170, BRAWH201 88750, BRAWH201 90550, 
BRAWH20191980, BRCAN20005230, BRCOC20003600, CTONG20051100, CTONG20070910, CTONG20076810, 
CTONG20079590, CTONG20080140, CTONG20085210, CTONG201 86290, DFNES20063460, DFNES20080880, 
FCBBF20023490, FCBBF20066340, FCBBF30004340, FCBBF30019140, FCBBF30022680, FCBBF30029250, 
FCBBF30072440, FCBBF30076310, FCBBF30085560, FCBBF30091520, FCBBF301 07290, FCBBF30 125880, 
FCBBF301 32050, FCBBF30 143550, FCBBF30153170, FCBBF301 66220, FCBBF30171230, FCBBF30 175350, 
FCBBF30236670, FCBBF30260480, FEBRA20038220, FEBRA20040560, FEBRA20078800, FEBRA20090160, 
FEBRA201 72230, FEBRA201 73330, HCHON20000870, HCHON20002710, HEART1 0001 490, HEART20022200, 
HEART20047640, HEART20082570, HLUNG20011260, HLUNG20032460, HLUNG20041540, HLUNG20042730, 
HLUNG20054790, KIDNE20080690, KIDNE20083620, KIDNE20084040, KIDNE20147170, KIDNE20 152440, 
KIDNE201 731 50, KIDNE201 861 70, KIDNE201 89960, LIVER2001 1 640, LIVER20026440, LIVER20055270, 
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MESAN20021130, MESAN20033220, MESAN20038520, ME5AN20057240, MESAN20058110, MESAN20065990, 
MESAN20095800, NT2NE20026200, NT2NE20042550, NT2NE201 17580, NT2NE201 27900, NT2R 120093010, 
NT2RP70064570, NTONG20034540, NTONG20043080, NTONG20053630, NTONG20053730, NTONG20058010, 
OCBBF201 20010, OCBBF201 67290, OCBBF20191950, PANCR1 0000860, PLACE60052940, PLACE60064180, 

5 PLACE60073090, PLACE60095600, PLACE601 84410, PLACE601 88630, PROST20007600, PROST20033240, 
PROST20036350, PROST20039300, PROST20050390, PROST20051310, PROST20052850, PROST20065790, 
PROST20075280, PROST20084720, PROST20099090, PROST201 08850, PROST201 52510, PUAEN20001520, 
PUAEN20002470, SKNMC20006350, SKNSH20057920, SMINT20008110, SMINT20049920, SM1NT20094680, 
SPLEN20023540, SPLEN20024930, SPLEN20043680, SPLEN20048800, SPLEN20054500, SPLEN20057900, 

10 SPLEN20071820, SPLEN20080070, SPLEN20085910, SPLEN201 08000, SPLEN201 36730, SPLEN201 80980, 
TESTI2001 2080, TESTI20030200, TESTI20031 31 0, TESTI20038240, TESTI20050720, TESTI2005 1 200, 
TESTI20059080, TESTI20062120, TESTI20066330, TESTI20076570, TESTI201 03690, TESTI201 05130, 
TESTI201 06820, TESTI201 08060, TESTI201 1 2860, TESTI201 21 040, TESTI201 30530, TESTI201 3 1 440, 
TESTI201 68880, TESTI201 701 70, TESTI201 96690, TESTI201 96970, TESTI201 991 1 0, TESTI20205250, 

15 TESTI20212970, TESTI20222030, TESTI20226520, TESTI20227380, TESTI 20244460, TESTI20244730. 
TESTI20250630, TESTI20260640, TESTI20262940, TESTI20264530, TESTI20285230, THYMU20006020, 
THYMU20013250, THYMU20034400, THYMU20039320, THYMU20055460, THYMU20055760, THYMU20063650, 
THYMU20066660, THYMU20070250, THYMU20087270, THYMU20096580, THYMU20 100940, THYMU201 10720, 
THYMU201 20240, THYMU201 20730, THYMU20 170230, TRACH20011 010, TRACH20021380, TRACH20091070, 

20 TRACH201 13020, TRACH20 143710, TRACH20164100, TRACH201 90460, UTERU20087070, UTERU20089620, 
UTERU201 04310, UTERU20 185220, UTERU201 88670 

[0247] The clones predicted to belong to the category of cell division and/or cell proliferation-related protein are the 
following 44 clones. 

ASTRO20090680, BRACE20079370, BRAMY20234820, BRCAN1 0001 050, BRCAN2000541 0, CTONG20032930, 
25 FCBBF20070950, FCBBF30002270, FCBBF30053300, FCBBF301 05860, FCBBF301 75350, FCBBF302 15240, 
FCBBF30275590, FEBRA20045380, HLUNG20068120, KIDNE201 34890, KIDNE201 50730, MESAN20021470, 
NT2NE20077250, NT2NE20 153620, NT2RP70030840, NTONG20053910, OCBBF20111370, OCBBF20 174580, 
PROST20063430, SKNMC1 0001 230, SMINT20028800, SPLEN20023540, SPLEN20057830, SPLEN201 39360, 
TESTI20031 41 0, TESTI20057840, TESTI20065650, TESTI20066650, TESTI201 07320, TESTI20 1 08060, 
30 TESTI201 14480, TESTI20 134680, TESTI20143180, TESTI201 50920, TESTI20201760, TESTI20278280, 
TESTI20284260, THYMU20097920 

[0248] The clones predicted to belong to the category of cytoskeleton- related protein are the following 80 clones. 
ADRGL20062330, ASTRO20053430, BGGI1 20000670, BRACE20079370, BRAMY20038980, BRAMY20083330, 
BRAMY20094890, CTONG20004110, CTONG20032930, CTONG20077760, CTONG20083980, CTONG20 169040, 

35 CTONG201 83430, DFNES20018000, FCBBF301 05860, FCBBF301 30410, FCBBF301 94550, FCBBF30201630, 
FCBBF30271990, FEBRA20005040, FEBRA20046200, FEBRA20099860, HCHON20015050, HLUNG20081530, 
KIDNE20081170, NT2RP70001 730, NT2RP70003110, NTONG200321 00, OCBBF201 66890, OCBBF201 74890, 
PLACE60054870, PLACE 60 055590, PLACE60071800, PLACE601 18810, PROST20015210, PROST20097840, 
PROST201 20070, PROST20 146590, SKMUS20007260, SKMUS20008730, SKMUS20017400, SKMUS20073590, 

40 SMINT20062050, SMINT20074330, SMINT20077960, SPLEN20039180, SPLEN20049840, SPLEN20076470, 
SPLEN201 82990, SPLEN201 87490, SPLEN201 95710, TEST1 10000 190, TESTI20041630, TESTI2005788o! 
TESTI20058920, TESTI20060080, TESTI20064530, TESTI20064650, TESTI20065650, TESTI20067440, 
TESTI20071 1 30, TESTI20099350, TESTI201 1 2540, TESTI201 25280, TESTI201 3601 0, TESTI201 5331 0, 
TESTI201 75370, TESTI20222460, TESTI 20244430, TEST120254030, TESTI20258720, THYMU20024500, 

45 THYMU20062610, THYMU20098350, TRACH20043360, TRACH20098510, TRACH201 49500, UTERU20089390, 
UTERU201 22520, UTERU20168960 

[0249] The clones predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein are 
the following 70 clones. 

ASTRO20026320, BRACE20050870, BRACE20200770, BRAMY201 34050, BRAWH20063010, BRAWH20093040, . 

50 BRAWH20 174330, BRAWH20 176850, CTONG20042640, FCBBF20023490, FCBBF20035490, FCBBF20070950, 
FCBBF30002270, FCBBF30048420, FCBBF30080730, FCBBF301 15920, FCBBF30236670, FEBRA20035240, 
FEBRA20092760, FEBRA201 73330, HHDPC20081230, HLUNG20011460, HLUNG20068120, KIDNE20089870, 
KIDNE201 50730, MESAN20056890, MESAN20057240, NT2NE20037050, NT2NE201 67660, NT2RP7003107o| 
NTONG20053730, PLACE600641 80, PLACE60095600, PROST200 16760, PROST20051310, PROST2005886o| 

55 PROST201 52510, PUAEN20002470, SKMUS20079150, SKNSH20030640, SPLEN20023850, SPLEN20057830, 
SPLEN201 39360, SPLEN201 90430, TESTI20006830, TESTI20030200, TESTI20031410, TESTI2003579o| 
TESTI20062120, TESTI20065650, TESTI20081890, TESTI201 50920, TESTI201 53310, TESTI2020176o! 
TESTI2021 2970, TESTI20227380, TESTI20251 740, TESTI20256560, TESTI20260640, TESTI202701 30, 
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TESTI20284260, TESTI20285230, THYMU20021090, THYMU20049060, THYMU20066660, THYMU20081110, 
THYMU20090230, THYMU201 20240, UTERU 10001 060, UTERU20104310 

[0250] The clones predicted to belong to the category of protein synthesis and/or transport-related protein are the 
following 20 clones. 

5 BRAMY20038980, BRAMY20274510, CTONG20008190, CTONG20033610, FCBBF20018680, FEBRA20090220, 
KIDNE20141700, NT2NE20 167660, NTONG20055200, PLACE60012620, PROST20036350, PROST20062820, 
SKMUS20040440, SMINT20000070, SPLEN201 80980, TESTI20055680, TESTI20067440, TESTI201 07240, 
THYMU20096580, THYMU20121040 

[0251 ] The clones predicted to belong to the category of cellular defense-related protein are the following 1 0 clones. 
10 ASTRO20089600, BRAMY20117670, FEBRA20087550, HLUNG20081390, MESAN20057240, NTONG20031580, 
PROST20007600, SPLEN20023850, SPLEN20043680, TESTI20261680 

[0252] The clones predicted to belong to the category of development and/or differentiation-related protein are the 
following 19 clones. 

BRACE20061620, BRACE20200770, BRAMY20013670, CTONG200 17490, CTONG20020950, HCHON1 00001 50, 
15 MESAN20021470, OCBBF201 65910, PROST20 155370, PUAEN20002470, TESTI20079220, TESTI20079980, 
TESTI201 66670, TESTI20 184760, TESTI20252690, TRACH20040390, UTERU20089620, UTERU20094830, 
UTERU201 69020 

[0253] The clones predicted to belong to the category of DNA-binding and/or RNA-binding protein are the following 
168 clones. 

20 ASTRO20038400, BGG1 12001 0750, BNGH420070370, BRACE20003310, BRACE20061620, BRAMY20001510, 
BRAMY20040580, BRAMY20076100, BRAMY20111780, BRAMY20274510, BRAWH20040680, BRAWH20050740, 
BRAWH20063010, BRAWH20080580, BRAWH20095900, BR AWH20 174330, BRSSN20066440, CTONG20020950, 
CTONG20044230, CTONG20053990, CTONG20072930, CTONG20074000, CTONG201 65750, CTONG201 86370, 
CTONG201 86520, DFNES20046840, DFNES20073320, FCBBF20035430, FCBBF20070950, FCBBF30002270, 

25 FCBBF30003610, FCBBF30019140, FCBBF30021 900, FCBBF30048420, FCBBF30080730, FCBBF30093170, 
FCBBF301 14850, FCBBF301 29010, FCBBF301 36230, FCBBF30220050, FCBBF30228940, FCBBF30236670, 
FCBBF30263080, FCBBF30285930, FCBBF50003530, FEBRA20026820, FEBRA20027070, FEBRA20035240, 
FEBRA20046510, FEBRA2005701 0, FEBRA20063720, FEBRA20087550, FE BR A2 0092760, FEBRA201 70240, 
FEBRA201 77800, HCHON20002650, HEART20019310, HEART20063100, HHDPC20081230, HLUNG20011460, 

30 HLUNG20014590, HLUNG20028110, HLUNG20063700, HLUNG20068120, KIDNE201 40870, LIVER20006260, 
MESAN20016270, MESAN20056890, MESAN20057240, NT2NE20038870, NT2NE20053950, NT2NE20060750, 
NT2NE20079670, NT2NE20082600, NT2NE20087270, NT2RP70029780, NT2RP70046410, NT2RP70057500, 
NT2RP70075300, NT2RP70090870, OCBBF201 16250, OCBBF201 20950, OCBBF20121910, OCBBF201 56450, 
OCBBF201 57970, OCBBF201 66900, OCBBF201 75360, OCBBF20 177540, PEBLM1 0001 470, PEBLM20003260, 

35 PLACE60066970, PLACE601 22970, PLACE601 77880, PROST20007170, PROST20024250, PROST20035170, 
PROST20051310, PROST20058860, PROST20151370, PROST20 155370, PUAEN20003120, SMINT20011950, 
SMINT20030740, SMINT20039050, SMINT20044140, SMINT20086720, SPLEN20042200, SPLEN20043680, 
SPLEN20055600, SPLEN20059270, SPLEN20063250, SPLEN20139360, SPLEN201 90430, TESTI1 0001 570, 
TESTI20006830, TESTI20030200, TESTI20031 41 0, TESTI20035790, TESTI20057430, TESTI2005981 0, 

40 TESTI20062120, TESTI20067480, TESTI20068790, TESTI20075240, TESTI20079220, TESTI20088840, 
TESTI201 04090, TESTI201 34970, TESTI201 66670, TESTI201 71 070, TESTI201 73960, TESTI201 84760, 
TESTI201 97600, TESTI20201 760, TESTI2021 2970, TESTI20227380, TESTI20228740, TESTI20246480, 
TESTI20254030, TESTI20254990, TESTI20266050, TESTI20268240, TESTI202701 30, TESTI20274960, 
TESTI20282530, TESTI20284260, TESTI20285230, THYMU 10004280, THYMU20019260, THYMU20023560, 

45 THYMU20032820, THYMU20049060, THYMU20066660, THYMU20071120, THYMU20077250, THYMU20081110, 
THYMU20090230, TKIDN1 0001 920, TRACH20 108240, UTERU10001060, UTERU20026620, UTERU20041630, 
UTERU20094830, UTERU20099510, UTERU201 01150, UTERU201 69020, UTERU20177150, UTERU201 88670 
[0254] The clones predicted to belong to the category of ATP binding and/or GTP-binding protein are the following 
93 clones. 

50 ASTRO20026320, BNGH420035290, BRACE20050870, BRACE20079370, BRACE20200770, BRAMY20055760, 
BRAMY201 18490, BRAMY20244490, BRAMY20251210, BRAWH20093040, BRAWH201 90550, BRCAN1 0001 050, 
BRCOC20003600, CTONG20008190, CTONG20030280, CTONG20032930, CTONG20 176040, CTONG20 184830, 
FCBBF20023490, FCBBF30019140, FCBBF30076310, FCBBF301 05860, FCBBF301 75350, FCBBF30201630, 
FCBBF30236670, FEBRA20005040, FEBRA20090160, FEBRA20 173330, HCHON20000870, HLUNG20011260, 

55 HLUNG20052300, KIDNE20081170, KIDNE20 134890, LIVER20030650, LIVER20055270, MESAN20065990, 
NT2NE20042550, NTONG20043080, NTONG20055200, OCBBF20 182060, PLACE60054870, PLACE60064180, 
PLACE60095600, P LAC E60 140640, PROST20015210, PROST20033240, PROST20036350, PROST20051310, 
PROST20052850, PROST20062820, PROST20075280, PROST201 20070, PUAEN20002470, SKNSH20052400, 
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SKNSH20057920, SMINT20008110, SPLEN20023850, SPLEN20043680, SPLEN20049840, SPLEN201 36730, 
SPLEN201 80980, SPLEN201 93790, TESTI20055680, TESTI20058920, TESTI20060080, TESTI20064650, 
TESTI20071 1 30, TESTI20099350, TESTI201 06820, TESTI201 1 2860, TESTI201 34680, TESTI201 3601 0, 
TESTI20143180, TESTI201 75370, TESTI202 12970, TESTI20222460, TESTI20227380, TESTI20244220, 
5 TESTI20244460, TESTI2 0264530, THYMU20013250, THYMU20039320, THYMU2006261 0, THYMU20066660, 
THYMU20087270, THYMU20096580, THYMU20 100940, THYMU201 76010, TRACH20043360, TRACH20098510, 
TRACH201 13020, UTERU201 85220, UTERU201 88670 

[0255] Among the clones other than the ones shown above, BNGH42003641 0 and FCBBF30257370 are clones 
which were predicted to highly possibly belong to the category of secretory protein and/or membrane protein based 
10 on the result of domain search by Pfam. 
SMINT20044730, TESTI20140970 

[0256] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
glycoprotein-related protein based on the result of domain search by Pfam. 

BRACE20055560, CTONG20046690, DFNES20043710, FCBBF30005500, ME SAN 20030350, MESAN20030370, 
15 PLACE60074820, TESTI20058350, TESTI20106170, TRACH20131230, UTERU20000950 

[0257] The 11 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

ASTRO20010290, BRACE20099070, CTONG20007660, DFNES20076340, DFNES20094820, FCBBF301 25460, 
FCBBF301 42290, FCBBF301 69280, FEBRA20031 000, NT2NE20026510, NT2RP70031340, PLACE50001390, 
20 SPLEN20135030, TESTI20046890, TESTI20060350, TESTI20 166290, TESTI20259110, THYMU20 184550 

[0258] The 18 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription-related protein based on the result of domain search by Pfam. 

ADRGL20047770, ADRGL20079060, BRACE200 14450, BRACE20051600, BRAWH 20 185260, CTONG20033750, 
CTONG20070090, CTONG201 90290, FCBBF20020440, FCBBF30 005360, FCBBF301 73960, FEBRA20031000, 
25 KIDNE20087880, LIVER20013890, MESAN20030350, MES AN 20030370, OCBBF20113110, PLACE60074820, 
PLACE60093380, PROST20028970, PROST20102190, SALGL1 0001070, SPLEN20006950, SPLEN20011350, 
SPLEN20050090, TESTI20060830, TESTI20066150, TESTI201 20900, TESTI2013231 0, TESTI201 48380, 
TESTI201 62980, TESTI201 66290, TESTI20205100, THYMU20112590, TRACH20029880 

[0259] The 35 clones shown above are clones which were predicted to highly possibly belong to the category of 
30 enzyme and/or metabolism-related protein based on the result of domain search by Pfam. 
PLACE60054820, TESTI201 97030 

[0260] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
cell division and/or cell proliferation-related protein based on the result of domain search by Pfam. 
ASTRO20006530, OCBBF2001 6390, TRACH20058000 
35 [0261] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
cytoskeleton-related protein based on the result of domain search by Pfam. 
BRACE20065470, PLACE60054820 

[0262] The two clones shown above are clones which were predicted to highly possibly belong to the category of 
nuclear protein and/or RNA synthesis-related protein based on the result of domain search by Pfam. 
40 ASTRO20010290, BRACE20099070, BRAWH20014590, CTONG20007660, DFNES20076340, DFNES20094820, 
FCBBF301 25460, FCBBF30 142290, FCBBF301 69280, FEBRA20031000, MESAN20034440, NT2NE20026510, 
NT2RP70031340, P LAC E5000 1390, SPLEN201 35030, TESTI20046890, TESTI20060350, TESTI201 66290, 
TESTI20259110, THYMU201 04480, THYMU20 184550 

[0263] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
45 DNA-binding and/or RNA-binding protein based on the result of domain search by Pfam. 
KIDNE20133880, MESAN20030350, MESAN20030370, TESTI20059480 

[0264] The four clones shown above are clones which were predicted to highly possibly belong to the category of 
ATP-binding and/or GTP-binding protein based on the result of domain search by Pfam. 

[0265] The 205 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
50 but have been predicted to have some functions based on homology search for their full-length nucleotide sequences 

and motif search in their deduced ORFs. Clone Name, Definition in the result of homology search or Motif Name in 

the motif search, demarcated by a double slash mark (II), are shown below. 

ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA). 

ADRGL20023920//ABC1 PROTEIN HOMOLOG PRECURSOR. 
55 ASTRO20001 91 0//Rattus norvegicus mRNA for annexin V-binding protein (ABP-1 0), partial cds. 

ASTRO20009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN. 

ASTRO20046280//PSU1 PROTEIN. 

ASTRO20058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe) 
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BNGH420024870//C2 domain// C2 domain// C2 domain 

BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN). 

BRACE20052430//Homo sapiens AMSH mRNA, complete cds. 

BRACE20054600//Xenopus laevis mRNA for Kielin, complete cds. 
5 BRACE2005981 0//TSC-22/dip/bun family 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 (FRAGMENT). 

BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds. 

BRAMY20016780//Proprotein convertase P-domain 

BRAMY20023640//UBX domain 
10 BRAMY20045420//Domain found in Dishevelled, Egl-10, and Pleckstrin 

BRAMY20056840//UBE-1C2 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds. 
BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds. 
BRAMY201 58550//CALMODULIN. 
is BRAMY20223010//Mus musculus leucine-rich glioma-in activated 1 protein precursor, (Lgi1) mRNA, complete cds. 
BRAMY20238630//TETRATRICOPEPTIDE REPEAT PROTEIN 4. 
BRAMY20245760//Araneus diadematus fibroin-4 mRNA, partial cds. 
BRAWH20047790//HMG (high mobility group) box 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain -associated protein 3A (Semcap3) mRNA, complete 
20 cds. 

BRSSN20005660//Bacterial type II secretion system protein 

BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds. 
CTONG20041150//Streptomyces ansochromogenes strain 7100 SanE (sanE) gene, complete cds. 
CTONG200661 1 0//Homo sapiens DEME-6 mRNA, partial cds. 
25 CTONG20069420//Ribosomal protein S1 4p/S29e 
CTONG2007I040//BETA CRYSTALLIN B2 (BP). 
CTONG20074170//DENN (AEX-3) domain 
CTONG20083430//Nuclear transition protein 2 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN). 
30 CTONG20174290//TRICHOHYALIN. 

CTONG20174580//Homo sapiens mRNA for vascular Rab-GAP/TBC-containing protein complete cds. 

CTONG201 80690//Collagen triple helix repeat (20 copies) 

CTONG20186550//cca3 protein - rat 

CTONG20188080//TPR Domain 
35 FCBBF1 0004760//Homo sapiens GAP-like protein (N61 ) mRNA, complete cds. 

FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN). 

FCBBF20041380//SAM domain (Sterile alpha motif) 

FCBBF20043730//UBA domain 

FCBBF20056580//Mus musculus NSD1 protein mRNA, complete cds. 

40 FCBBF20059660//TPR Domain 

FCBBF300 1 9 1 80//SE R IN E ITH REON I N E PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY SUBUNIT A, AL- 
PHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISOFORM). 
FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP140 mRNA, complete cds. 
FCBBF30035570//C2 domain 

4 5 FCBBF30079770//D-isomer specific 2-hydroxyacid dehydrogenases 

FCBBF301 001 20//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds. 

FCBBF30100410//Mus musculus testis-specific Y-encoded-like protein (Tspyll) mRNA, complete cds. 
FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
50 FCBBF30138000//trg protein - rat 

FCBBF30157270//Rattus norvegicus PAPIN mRNA, complete cds. 

FCBBF301 61 780//gag gene protein p24 (core nucleocapsid protein)// Zinc knuckle 

FCBBF301 98670//dof protein - fruit fly (Drosophila melanogaster) 

FCBBF30222910//Mus musculus Rap2 interacting protein 8 (RPIPB) mRNA, complete cds. 
55 FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cds. 
FCBBF30260210//Drosophila melanogaster KISMET-L long isoform (kis) mRNA, complete cds. 
FCBBF30282020//cca3 protein - rat 

FCBBF40000610//late gestation lung 2 protein [Rattus norvegicus]. 
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FEBRA20029620//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat 
FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
FEBRA20038330//Corticotropin-re!easing factor family 
FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene). 
FEBRA2008861 0//CELLULAR RETINALDEHYDE-BINDING PROTEIN (CRALBP). 
FEBRA201 50420//HYPOTHETICAL 131.5 KDA PROTEIN C02F12.7 IN CHROMOSOME X. 
FEBRA201 75330//D-isomer specific 2-hydroxyacid dehydrogenases 
HEART10001420//Mus musculus skm-BOP1 (Bop) mRNA, complete cds. 
HLUNG20024050//Rubredoxin 

HLUNG20030420//Mus musculus mRNA for MAIL, complete cds. 
HLUNG20030490//Ambystoma tigrinum RPE65 protein mRNA, complete cds. 
HLUNG20033060//Homo sapiens GAP-iike protein (N61) mRNA, complete cds. 
HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens]. 
HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2). 
HLUNG20051330//FHIPEP family 

HLUNG20070410//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
HLUNG20072100//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds. 
HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52. 
KIDNE20027980//SAM domain (Sterile alpha motif) 

KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds. 
KIDNE20149780//NG28 [Mus musculus] 

KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein. 

KIDNE20170400//Leucine Rich Repeat// Leucine Rich Repeat// Leucine Rich Repeat// Protein kinase C terminal do- 
main// Rubredoxin 

KIDNE20189890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds. 
LIVER20010760//Homo sapiens C-type lectin-like receptor-1 mRNA, complete cds. 
LIVER20040740//RETINAL-BINDING PROTEIN (RALBP). 
MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA 
MESAN20026870//PAN domain// TBC domain 
MESAN20090190//CEGP1 protein [Homo sapiens]. 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds. 
NT2NE20077270//Adenovirus EB1 55K protein / large t-antigen 

NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT). 
NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds. 
NT2NE20108420//KES1 PROTEIN. 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds. 
NT2NE20177210//Leishmania major partial ppgl gene for proteophosphoglycan. 
NT2RP70012830//CALPHOTIN. 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds. 
NTONG20002230//Mus musculus RW1 protein mRNA, complete cds. 
NTONG20005310//Ribosomal protein S9/S16 
NTONG20029850//CALCYPHOSINE (R2D5 ANTIGEN). 
NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN). 
NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds. 
OCBBF20005220//Rattus norvegicus Fos-related antigen mRNA, complete cds. 
OCBBF20011860//Mus musculus epithelial protein lost in neoplasm-a (Eplin) mRNA, complete cds. 
OCBBF20016810//enhancer of polycomb [Mus musculus] 
OCBBF20147070//DNA polymerase (viral) C-terminal domain 
OCBBF20160380//liver stage antigen LSA-1 - Plasmodium falciparum 
OCBBF201 7791 0//Corticotropin-releasing factor family 
PEBLM20005020//Virion host shutoff protein 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds. 
PLACE60068710//SUPPRESSOR PROTEIN SRP40. 
PLACE60080360//mucin [Homo sapiens] 
PLACE60082850//Pathogenesis-related protein Bet v I family 

PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds. 
PLACE60105680//Homo sapiens mRNA for TU12B1-TY, complete cds. 
PLACE601 1 9700//Homo sapiens mRNA for ABP32, complete cds. 
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PLACE601 20280//SER/THR-RICH PROTEIN T10 IN DGCR REGION. 

PLACE60132200//TRICHOHYALIN. 

PLACE601 81 870//Pentaxin family 

PROST20084470//Plasmodlum berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds. 
5 PROST20087240//gag gene protein p24 (core nucleocapsid protein) 
PROST20122490//Gallus gallus syndesmos mRNA, complete cds. 
PROST20130320//S-100/ICaBP type calcium binding domain 
PROST20152870//Homo sapiens APC2 gene, exon 14. 

PUAEN10001640//Mus musculus cerebellar postnatal development protein-1 (Cpd1) mRNA, partial cds. 
10 PUAEN20000800//Bleomycin resistance protein 

SMINT20012220//Collagen triple helix repeat (20 copies) 

SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds. 
SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds. 
SMINT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds. 
15 SMINT20043390//Ras association (RalGDS/AF-6) domain 
SMINT20048720//Cytochrome P450// Cytochrome P450 

SMINT20052130//Rattus norvegicus mRNA for gankyrin homologue, complete cds. 

SMINT20054050//ABC1 PROTEIN HOMOLOG PRECURSOR. 

SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds. 
20 SPLEN20040780//CORNIFIN B (SMALL PROLINE-RICH PROTEIN 1 B) (SPR1 B) (SPR1 B). 

SPLEN20041 81 0//BC-2 protein [Homo sapiens] 

SPLEN20100040//258.1 KDA PROTEIN C210RF5 (KIAA0933). 

SPLEN20104150//Ribosomal protein L36 

SPLEN20116720//Homo sapiens misato mRNA, partial cds. 
25 SPLEN201 761 30//Homo sapiens mRNA for ALEX1 , complete cds. 

SPLEN201 8 1 570//TRICHOHYALIN . 

TEST1 2000431 0//TRICHOHYALIN. 

TESTI20016970//TPR Domain 

TESTI20030440//TRICHOHYALIN. 
30 TESTI200431 80//mouse mRNA for megakaryocyte potentiating factor, complete cds. 

TESTI20043910//IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmod- 
ulin-binding motif// IQ calmodulin-binding motif 

TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds. 
TESTI20046110//Extracellular link domain 
35 TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds. 
TESTI20049410//Proprotein convertase P-domain 
TESTI20053950//IQ calmodulin-binding motif 

TESTI20054700//Streptococcus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8. 1 allele, complete 
cds. 

to TESTI20055880//Serum amyloid A protein 

TESTI20056030//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds. 
TESTI20061090//Keratin, high sulfur B2 protein 

TESTI20064370//TPR Domain// TPR Domain// TPR Domain// TPR Domain// Synaptobrevin 
TESTI20084250//OXYSTEROL-BINDING PROTEIN. 
45 TESTI200921 70//ENV polyprotein (coat polyprotein) 
TESTI20116050//UBX domain 
TESTI20120500//Kelch motif// Kelch motif 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds. 

TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS13. 
50 TESTI201 65990//Ribosomal protein L36 

TESTI201 69500//HYPOTHETICAL 51.9 KDA PROTEIN C27F1.04C IN CHROMOSOME I. 

TESTI20170280//Flagellar L-ring protein 

TESTI20176450//thioredoxin interacting factor [Mus musculus]. 

TESTI20179230//Dihydropyridine sensitive L-type calcium channel (Beta subunit) 
55 TESTI201 80600//Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds. 

TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRE5-FET4 INTERGENIC REGION. 

TESTI20210570//RETINAL-BINDING PROTEIN (RALBP). 

TESTI20215310//Homo sapiens calcyclin binding protein mRNA, complete cds. 
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TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds. 
TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds. 
TESTI20250220//TRICHOHYALIN. 

TESTI20251440//Rattus norvegicus (rsec6) mRNA, complete cds. 
s TESTI20255460//Mus musculus mRNA for MIWI (piwi), complete cds. 
THYMU20009500//TPR Domain 

THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1). 

THYMU2001381 0//Human SEC7 homolog Tic (TIC) mRNA, complete cds. 

THYMU20018250//TPR Domain 
10 THYMU20026950//Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds. 

THYMU20028410//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds. 

THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds. 

THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds. 

THYMU20052460//PHORBOLIN I (FRAGMENTS). 
15 THYMU20055450//Zona pellucida-like domain 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds. 

THYMU20139160//Uncharacterized protein family UPF0031 

THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds. 
TRACH20093400//TRICHOHYALIN. 
20 TRACH201 0451 0//Un characterized protein family UPF0005 
TRACH20122980//HYPOTH ETICAL PROTEIN MJ0798. 
TRACH20139280//PX domain 

TRACH20164820//D-isomer specific 2-hydroxyacid dehydrogenases 

TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta Ol type II deletion. 
25 UTERU20051790//guanylate kinase-interacting protein 1 Maguin-1 , membrane-associated - rat 
UTERU20083020//Domain of unknown function DUF71 
UTERU201 21 1 40//Rhodanese-like domain 

UTERU201 28560/1 26.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION. 
UTERU20132620//AXONEME-ASSOCIATED PROTEIN MST101(2). 
30 UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] 
UTERU20181270//Zinc knuckle 

[0266] With respect to the remaining 613 clones, there are so far no information available for estimating their func- 
tions. However, there is the possibility that the functions of these clones will be revealed in future. Their Clone Names 
are indicated below. 

35 ADRGL20027530, ADRGL20040310, AD RGL20 040770, ADRGL20046760, ADRGL20047080, ADRGL20057560, 
ADRGL20067320, ADRGL20095330, ASTRO20003720, ASTRO20004820, ASTRO20012270, AST RO2002 0350, 
ASTRO20022020, ASTRO20027330, ASTRO20047510, ASTRO20069200, ASTRO20076660, AST RO2009 1770, 
ASTRO20141740, BNGH41 0000570, BNGH420014060, BNGH420040760, BNGH420042910, BNGH420045380, 
BNGH420061350, BNGH420 062340, BNGH420085100, BRACE20009050, BRACE2001 7790, BRACE2001 8810, 

40 BRACE20025820, BRACE20038920, BRACE20054480, BRACE20057870, BRACE20059110, BRACE20062580, 
BRACE20069440, BRACE20098860, BRACE201 96960, BRACE20200970, BRACE20205840, BRACE20207420, 
BRACE20212450, BRACE202 16700, BRACE20219360, BRAMY1 0000980, BRAMY2000021 0, BRAMY20000250, 
BRAMY20020440, B RAM Y2002 1580, BRAMY20023390, BRAMY20036530, BRAMY20036810, BRAMY20039290, 
BRAMY20043520, BRAMY20050640, BRAMY20052440, BRAMY20073080, BRAMY20074110, BRAMY20074860, 

45 BRAMY20076130, BRAMY20076530, BRAMY20095080, BRAMY20095570, BRAMY201 00680, BRAMY201 07980, 
BRAMY20120170, B RAM Y20 124970, BRAMY20125170, BRAMY201 26910, BRAMY201 39750, BRAMY20 155500, 
BRAMY201 59250, B RAM Y20 160020, BRAMY201 73480, BRAMY202 19620, BRAMY20225250, BRAMY20227230, 
BRAMY20227960, BRAMY20243120, BRAMY20245350, BRAMY20267780, BRAMY20269040, BRAMY20271140, 
BRAMY20287400, BRAWH20020600, BRAWH20025490, BRAWH20027250, BRAWH20055240, BRAWH20055780, 

50 BRAWH20058120, BRAWH20078080, BRAWH20082550, BRAWH201 73790, BRAWH20 175230, BRAWH2017534o| 
BRAWH201 82670, BRAWH20 186010, BRCOC1 0000400, BRHIP20003590, BRHIP20005060, BRSSN20092440, 
CTONG1 0000090, CTONG20000340, CTONG20002790, CTONG20008460, CTONG200 15240, CTONG20020660, 
CTONG20027660, CTONG20031150, CTONG20031890, CTONG20033500, CTONG20035240, CTONG20036800, 
CTONG20039370, CTONG20050490, CTONG20055670, CTONG20057750, CTONG20057950, CTONG20061 290, 

55 CTONG20062730, CTONG20065240, CTONG20073990, CTONG20074740, CTONG200 76230, CTONG20081840, 
CTONG201 33720, CTONG20 165590, CTONG201 66580, CTONG20 168460, CTONG20 169530, CTONG20 174440, 
CTONG201 79390, CTONG 20 179980, CTONG201 80620, CTONG20181350, CTONG201 84130, CTONG2018614o! 
CTONG201 90630, DFNES20032550, DFNES20088810, FCBBF1 0002200, FCBBF20021110, FCBBF20028980, 
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FCBBF20038230, FCBBF20038950, FCBBF20061310, FCBBF20070800, FCBBF30000010, FCBBF30001020, 
FCBBF30001150, FCBBF30002330, FCBBF30004730, FCBBF30005180, FCBBF300 19240, FCBBF30056980, 
FCBBF30063990, FCBBF30068210, FCBBF30072480, FCBBF30074530, FCBBF30074620, FCBBF30081000, 
FCBBF30088700, FCBBF30089380, FCBBF30091010, FCBBF30099490, FCBBF30101240, FCBBF30101300, 

5 FCBBF301 05080, FCBBF301 06950, FCBBF301 07330, FCBBF30114180, FCBBF301 15230, FCBBF30 128420, 
FCBBF301 30580, FCBBF301511 90, FCBBF301 70710, FCBBF301 79180, FCBBF30181730, FCBBF30 194370, 
FCBBF301 95700, FCBBF40001920, FCBBF40005000, FCBBF5000041 0, FEBRA20035200, FEBRA20039070, 
FEBRA20040260, FEBRA20040290, FEBRA20076200, FEBRA20078180, FEBRA2 0082660, FEBRA20083410, 
FEBRA20086600, FEB RA2009 1620, FEBRA20093270, FEBRA20093280, FEBRA2009541 0, FEBRA20098040, 

10 FEBRA20101410, FEBRA20 108020, FEBRA201 08580, FEBRA20121200, FEBRA20 163980, FEBRA201 75020, 
FEBRA201 80510, FEBRA201 87460, HHDPC20082970, HLUNG20009260, HLUNG20009550, HLUNG20010130, 
HLUNG20011440, HLUNG20012140, HLUNG20020500, HLUNG20021450, HLUNG20023030, HLUNG20025620, 
HLUNG20029420, HLUNG20030610, HLUNG20031620, HLUNG20033310, HLUNG20037140, HLUNG20037780, 
HLUNG20038330, HLUNG20047070, HLUNG20055240, HLUNG20056560, HLUNG20057380, HLUNG20059240, 

15 HLUNG20065700, HLUNG20067810, HLUNG20072190, HLUNG20072450, HLUNG20079310, HLUNG20083840, 
HLUNG20083960, HLUNG20093030, HLUNG20094130, KIDNE20011 600, KIDNE20024380, KIDNE20086970, 
KIDNE20091090, KIDNE20094260, KIDNE20095530, KIDNE201 37310, KIDNE201 38450, KIDNE20141120, 
KIDNE201 42680, KIDNE20 154830, KIDNE201 55980, KIDNE20157100, KIDNE201 76030, KIDNE20181 670, 
KIDNE20191870, LIVER20007690, LIVER20032340, MESAN20008940, MESAN20021860, MESAN20029780, 

20 MESAN20030390, MESAN20041380, MESAN20045750, MESAN20060220, MESAN20085360, MESAN20089260, 
MESAN20094180, NESOP20005040, NT2NE20028700, NT2NE20033150, NT2NE20045190, NT2NE20047870, 
NT2NE20062880, NT2NE20066590, NT2NE20070520, NT2NE20073650, NT2NE20077860, NT2NE20086070, 
NT2NE20088030, NT2NE20 104000, NT2NE201 07810, NT2NE201 12210, NT2NE201 14850, NT2NE201 23610, 
NT2NE201 24570, NT2NE201 26030, NT2NE201 40280, NT2NE20 148690, NT2NE20 149500, NT2NE201 50610, 

25 NT2NE20157120, NT2NE20165190, NT2NE20181760, NT2NE201 84720, NT2RP70022820, NT2RP70049610, 
NT2RP70056290, NT2RP70074800, NT2RP70080150, NT2RP 70084540, NTONG200 17620, NTONG20049180, 
OCBBF20001780, OCBBF20009820, OCBBF201 09450, OCBBF201 12320, OCBBF201 15360, OCBBF2011 7220, 
OCBBF201 19810, OCBBF201 42290, OCBBF201 52330, OCBBF201 88280, PLACE60001910, PLACE60011180, 
PLACE60017120, PLACE60055350, PLACE60062870, PLACE60063940, PLACE60069880, PLACE60072390, 

30 PLACE60072420, PLACE60077870, PLACE60081260, PLACE60088240, PLACE60092280, PLACE60092370, 
PLACE60095240, PLACE601 09910, PLACE 60 132320, PLACE 60 132880, PLACE601 55910, PLACE601 57310, 
PLACE60162100, P LAC E601 75640, PLACE601 77910, PROST1 0001 360, PROST1 0002150, PROST20011800, 
PROST20014140, PROST20014650, PROST20015400, PROST20022120, PROST20036280, PROST20041460, 
PROST20042700, PROST20047440, PROST20048770, PROST20052720, PROST20054660, PROST20060200, 

35 PROST20078710, PROST20 093470, PROST20094000, PROST2009731 0, PROST20097360, PROST201 02500, 
PROST201 03820, PROST20121570, PROST20 124000, PROST20 125420, PROST201 38730, PROST201 56360, 
PROST201 59320, SKMUS20026340, SKMUS20064810, SKNSH 10001 010, SKNSH20007160, SKNSH20040390, 
SKNSH20068220, SMINT20011830, SMINT20013970, SMINT2001461 0, SMINT20017310, SMINT20021260, 
SMINT20023110, SMINT20031280, SMINT20045830, SMINT20045890, SMINT20047290, SMINT20056240, 

40 SMINT20067080, SMINT20070620, SMINT20077920, SMINT2008491 0, SMINT20085310, SMINT20085450, 
SMINT20089220, SMINT20092120, SMINT20093630, SMINT20094150, SPLEN20005160, SPLEN20005370, 
SPLEN20012450, SPLEN20015030, SPLEN20016500, SPLEN2001 9120, SPLEN20020530, SPLEN20023430, 
SPLEN20024510, SPLEN200291 70, SPLEN20036780, SPLEN20043430, SPLEN20043460, SPLEN20045550, 
SPLEN20051420, SPLEN20062830, SPLEN20067010, SPLEN20076190, SPLEN20081640, SPLEN20087370, 

45 SPLEN20087860, SPLEN201 08460, SPLEN20110210, SPLEN20111450, SPLEN20114190, SPLEN20117580, 
SPLEN20126110, SPLEN201 37530, SPLEN201 92570, SPLEN201 93750, SPLEN201 97090, SPLEN201 97740, 
SPLEN201 99850, SPLEN20200070, SPLEN20200340, SPLEN20203590, SPLEN20205120, TESOP1 0001600, 
TESTI1 0000850, TESTI20005980, TESTI2001 2360, TESTI2001 9590, TESTI20028020, TESTI2003061 0, 
TESTI20034750, TESTI20035330, TESTI20040850, TESTI20045740, TESTI20049990, TESTI200501 70, 

50 TESTI20052670, TESTI20053800, TESTI20059330, TESTI20059370, TESTI20059790, TESTI20060150, 
TESTI20060450, TESTI200621 80, TESTI20062580, TESTI20064990, TESTI200661 70, TESTI20066280, 
TESTI20066590, TESTI20067350, TESTI20068940, TESTI20076920, TESTI20079060, TESTI20080460, 
TESTI20083890, TESTI20085670. TESTI20089290, TESTI200901 80, TESTI20090970, TESTI20091360, 
TESTI20093900, TESTI20094620, TESTI20097270, TESTI201 07340, TESTI201 1 31 50, TESTI201 1 7500, 

55 TESTI201 18460, TESTI201 22440, TESTI20 124440, TESTI20 125440, TESTI201 32680, TESTI201 34010, 
TESTI201 34270, TESTI201 42480, TESTI201 51 050, TESTI201 52490, TESTI201 59380, TESTI201 61 01 0, 
TESTI201 65680, TESTI201 67580, TESTI201 70690, TESTI201 70890, TESTI201 7311 0, TESTI201 7951 0, 
TESTI201 8221 0, TESTI201 84280, TESTI201 84820, TESTI201 92570, TESTI201 93520, TEST1201 97290, 
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TESTI201 98600, TESTI201 99980, TESTI202001 20, TESTI20200840, TESTI202051 50, TESTI2021 1 380, 
TESTI2021 9390, TESTI20221 790, TESTI20223380, TESTI20248850, TESTI20254090, TESTI20254480, 
TESTI20259200, TESTI202601 40, TESTI202651 50, TESTI20265340, TESTI20265890, TESTI20269250, 
TESTI20269360, TESTI20272380, TESTI202 77300, TESTI20287760, THYMU20007750, THYMU20008000, 

5 THYMU20009460, THYMU20009710, THYMU20010710, THYMU20012560, THYMU20014430, THYMU2001 8390, 
THYMU20019000, THYMU20020370, THYMU20021540, THYMU20029830, THYMU20036500, THYMU20043440, 
THYMU20043560, THYMU20044100, THYMU20044520, THYMU20051340, THYMU20058550, THYMU20060480, 
THYMU20062520, THYMU20064680, THYMU20069130, THYMU20069460, THYMU20069650, THYMU20071460, 
THYMU20072580, THYMU20073070, THYMU20073080, THYMU20078020, THYMU20080490, THYMU20083500, 

10 THYMU20084520, THYMU20086430, THYMU20089170, THYMU20089900, THYMU20091040, THYMU201 12570, 
THYMU201 15730, THYMU201 17850, THYMU201 28910, THYMU201 29020, THYMU201 30470, THYMU201 34260, 
THYMU201 40510, THYMU2014801 0, THYMU20 149230, THYMU201 57620, THYMU201 74490, THYMU201 74790, 
THYMU201 75260, THYMU201 77070, THYMU20181890, THYMU20 187210, TKIDN1 0001 710, TRACH200 12490, 
TRACH20021000, TRACH20025370, TRACH20026640, TRACH20041 090, TRACH20044990, TRACH20049500, 

15 TRACH20051590, TRACH20057200, TRACH20080810, TRACH20093480, TRACH201 01590, TRACH20 123870, 
TRACH201 24970, TRACH201 25620, TRACH20129180, TRACH201401 80, TRACH201 58240, TRACH201 60800, 
TRACH201 74980, TRACH201 82780, TRACH20185120, UTERU1 0001 870, UTERU20000230, UTERU20011760, 
UTERU20013890, UTERU20027360, UTERU20029930, UTERU20031350, UTERU20040370, UTERU20040390, 
UTERU20040730, UTERU20041 970, UTERU20065470, UTERU20079240, UTERU20090940, UTERU20091470, 

20 UTERU201 02260, UTERU20 103040, UTERU201 06510, UTERU201 40010, UTERU201 67570, UTERU201 73030, 
UTERU201 76230 



EXAMPLE 7 



25 Expression frequency analysis in silico 

[0267] The cDNA libraries derived from various tissues and cells as indicated in Example 1 were prepared, and 
cDNA clones were selected from each library at random. The 5'-end sequences were determined and the database 
was constructed based on the data. The database was constructed based on the nucleotide sequences of 770,546 

30 clones, and thus the population of the database is large enough for the analysis. 

[0268] Then, clones having a homologous sequence are categorized into a single cluster (clustering) by searching 
the nucleotide sequences of respective clones in this database with the program of nucleotide sequence homology 
search; the number of clones belonging to each cluster was determined and normalized for every library; thus, the 
ratio of a certain gene in each cDNA library was determined. This analysis gave the information of the expression 

35 frequency of genes in tissues and cells which were sources of the cDNA libraries. 

[0269] Then, in order to analyze the expression of a gene containing the nucleotide sequence of the cDNA of the 
present invention in tissues and cells, the library derived from a tissue or a cell used in the large-scale cDNA analysis 
was subjected to the comparison of the expression levels between tissues or cells. Namely, the expression frequency 
was analyzed by comparing the previously normalized values between tissues and/or cells for which the nucleotide 

40 sequences of 600 or more cDNA clones had been analyzed. By this analysis, some of the genes were revealed to be 
involved in the pathology and functions indicated below. Each value in Tables 3 to 39 shown below represents a relative 
expression frequency; the higher the value, the higher the expression level. 



Osteoporosis-related genes 

45 

[0270] Osteoporosis is a pathology in which bones are easily broken owing to overall decrease in components of 
bone. The onset involves the balance between the functions of osteoblast producing bone and osteoclast absorbing 
bone, namely bone metabolism. Thus, the genes involved in the increase of osteoclasts differentiating from precursor 
cells of monocyte/macrophage line (Molecular Medicine 38. 642-648. (2001)) are genes involved in osteoporosis rel- 

50 evant to bone metabolism. 

[0271] A nucleotide sequence information -based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower in CD34+ cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 
entiation factor (Molecular Medicine 38. 642-648. (2001)) than in the untreated CD34+ cell, which is the precursor cell 
of monocyte/macrophage line. The result of comparative analysis for the frequency between the two cDNA libraries 

55 prepared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ cells treated with the osteoclast differ- 
entiation factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the 
two were the following clones (Table 3). 

ASTRO20010290, BRAMY20036530, BRAMY20043630, BRAMY20089770, BRAMY20 190550, CD34C20001750, 
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FCBBF20066340, FEBRA20040290, HLUNG20015180, HLUNG20041590, HLUNG20052300, KIDNE20084040, 
MESAN20021860, MESAN20027240, NTONG20055200, PROST20016760, PUAEN1 0001 640, SMINT2000602o! 
SMINT20028840, SMINT20035050, SPLEN201 81570, TESTI20064530, TESTI2021 0030, THYMU20029830, 
THYMU20139160, TRACH20051590 
5 [0272] These genes are involved in osteoporosis. 

Genes involved in neural cell differentiation 

[0273] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
10 expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
logical diseases. 

[0274] A survey was performed for genes whose expression levels are varied in response to induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a neural 
strain, NT2, The result of comparative analysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RM) and 
15 the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
levels were different between the two were the following clones (Table 4). 

ADRGL20023920, ASTR020009140, BNGH420077980, BNGH420086030, BRACE20062580, BRACE20079370, 
BRACE20215410, BRAMY20003540, BRAMY20043630, BRAMY20076130, BRAMY20095080, BRAMY20227860, 
BRAWH20082550, BRHIP1 0001 040, BRSSN20005610, CTONG20027660, CTONG20044230, CTONG20066110, 

20 CTONG20079590, CTONG20084660, CTONG201 33720, CTONG20 165750, CTONG201 88080, FCBBF20023490, 
FCBBF20033360, FCBBF20059660, FCBBF20070950, FCBBF30004340, FCBBF30095410, FCBBF301 25460, 
FCBBF30179180, FCBBF30236670, FCBBF30257370, FCBBF50000610, FCBBF50001650, FEBRA20038330, 
FEBRA20039260, FEBRA20063720, FEBRA20090220, FEBRA20 150420, HEART1 0001 490, HLUNG20032460, 
HLUNG20041590, KIDNE20089870, MESAN2001 6270, MES AN 2002 1860, MESAN20060430, MESAN20067430, 

25 NT2NE20018740, NT2NE20018890, NT2NE20021 860, NT2NE20026200, NT2NE20026510, NT2NE20028700, 
NT2NE20033150, NT2NE20037050, NT2NE20038870, NT2NE20039210, NT2NE20042550, NT2NE20045190, 
NT2NE20047870, NT2NE20053230, NT2NE20053950, NT2NE20059210, NT2NE20059680, NT2NE20060750, 
NT2NE20061030, NT2NE20062880, NT2NE20064780, NT2NE20066590, NT2NE20069580, NT2NE20070520, 
NT2NE20073650, NT2NE20077250, NT2NE20077270, NT2NE20077860, NT2NE20079670, NT2NE20080770, 

30 NT2NE20082130, NT2NE20082600, NT2NE20086070, NT2NE20087270, NT2NE20087850, NT2NE20088030, 
NT2NE20092950, NT2NE20095230, NT2NE201 04000, NT2NE201 0781 0, NT2NE201 08420, NT2NE20111190, 
NT2NE20112210, NT2NE201 14850, NT2NE20117580, NT2NE20119980, NT2NE20 123610, NT2NE20 124570, 
NT2NE201 26030, NT2NE201 27900, NT2NE20140130, NT2NE201 40280, NT2NE20141040, NT2NE20 145250, 
NT2NE201 46510, NT2NE20 148690, NT2NE20 149500, NT2NE20 150610, NT2NE20 152620, NT2NE20 153620, 

35 NT2NE201 55650, NT2NE20157120, NT2NE201 65190, NT2NE201 67660, NT2NE20 173970, NT2NE201 77210, 
NT2NE20181760, NT2NE201 81 800, NT2NE201 84720, NT2RI20016240, NT2RI20021200, NT2RI20033920, 
NT2RI20093010, NT2RP70001120, NT2RP70001730, NT2RP70003110, NT2RP70012830, NT2RP70022820, 
NT2RP70027790, NT2RP70029780, NT2RP70030840, NT2RP70031070, NT2RP70031340, NT2RP70031480, 
NT2RP70035110, NT2RP70046410, NT2RP70049610, NT2RP70056290, NT2RP70056690, NT2RP70057500, 

40 NT2RP70064570, NT2RP70074800, NT2RP70075300, NT2RP70075800, NT2RP70080150, NT2RP70084540, 
NT2RP70087140, NT2RP70090870, OCBBF20001780, OCBBF20009820, OCBBF20 142290, OCBBF201 55030, 
OCBBF201 75360, OCBBF201 77540, OCBBF201 77910, PLACE60054820, PLACE60061370, PLACE60073090, 
PLACE60162100, PROST20011800, PROST20045700, PROST20078710, PROST20094000, PUAEN1 0000650, 
PUAEN1 0001 640, SKNMC20006350, SMINT20016150, SMINT20030740, SMINT20035510, SMI NT2003 9050, 

45 SMINT20047290, SPLEN20063250, SPLEN201 17580, SPLEN201 25230, TESTI20030610, TESTI20043910, 
TESTI20066280, TESTI20067480, TESTI20105130, TESTI201 061 70, TESTI20143180, TESTI20221790, 
TESTI20254090, TESTI20274960, THYMU 10004280, THYMU20007020, THYMU20 104480, THYMU20139160, 
TRACH20026640, UTERU1 0001 060, UTERU20026620, UTERU20079240, UTERU20083020, UTERU201 02260, 
UTERU201 32620 

50 [0275] These genes are neurological disease-related genes. 

Cancer-related genes 

[0276] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
55 thus the expression can contribute to the carcinogenesis in tissues and cells. Thus, the genes whose expression 
patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
for the genes whose expression levels in cancer tissues were different from those in normal tissues. 
[0277] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 
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(BEAST) showed that the genes whose expression levels were different between the two were the following clones 
(Table 5). 

CTONG20070780, CTONG20084660, HLUNG20045340, TESTI20047370 

[0278] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
5 (CERVX) showed that the genes whose expression levels were different between the two were the following clones 
(Tables). 
SMINT20030740 

[0279] The result of comparative analysis of cDNA libraries derived from colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two were the following clones 
10 (Table 7). 

UTERU2004037 

[0280] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (N ESOP) showed that the genes whose expression levels were different between the two were the following 
clones (Table 8). 

15 HLUNG20015180, NESOP20004520, NESOP20005040, TESOP1 0000350, TESOP1 0001 600, THYMU20071120 
[0281] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 9). 

ASTRO20009140, ASTRO20027330, ASTRO20055930, BGGI1 2001 0750, BNGH420074600, BRACE20050870, 

20 BRACE20054480, BRACE20062580, BRACE20219360, BRAMY20003540, BRAMY20003880, BRAMY20043630, 
BRAMY20055760, B RAM Y20 125360, B RAM Y201 90550, BRAMY20204270, BRAMY20227860, BRAWH2001 4590, 
BRAWH20093070, BRHIP10001 040, CTONG20033750, CTONG20039370, CTONG20045500, CTONG20079590, 
FCBBF20023490, FCBBF30004340, FCBBF301 06950, FCBBF301 15230, FCBBF301 69280, FCBBF30225930, 
FCBBF30282020, FEBRA20038330, FEBRA20039260, FEBRA20040290, FEBRA20082660, FEBRA20121200, 

25 FEBRA201 70240, HEART1 0001 490, HLUNG20041590, HLUNG2006B120, HLUNG20072450, HLUNG20083480, 
HLUNG20083960, KIDNE20011600, KIDNE20016360, KIDNE20024380, KIDNE20027980, KIDNE20080690, 
KIDNE20081170, KIDNE20083150, KIDNE20083620, KIDNE20084030, KIDNE20084040, KIDNE20084730, 
KIDNE20084800, KIDNE20086490, KIDNE20086660, KIDNE20086970, KIDNE20087880, KIDNE20088240, 
KIDNE20089870, KIDNE20091090, KIDNE20094260, KIDNE20094670, KIDNE20095530, KIDNE201 33460, 

30 KIDNE20133880, KIDNE201 34130, KIDNE201 34890, KIDNE20137310, KIDNE20 138450, KIDNE201 40870, 
KIDNE20141120, KIDNE20141700, KIDNE20142680, KIDNE20142680, KIDNE201 42900, KIDNE201 43200, 
KIDNE20147170, KIDNE201 48080, KIDNE201 49780, KIDNE20150730, KIDNE201 52440, KIDNE201 54330, 
KIDNE201 54830, KIDNE201 55980, KIDNE20157100, KIDNE201 60360, KIDNE201 60960, KIDNE201 63710, 
KIDNE201 65390, KIDNE20169180, KIDNE201 70400, KIDNE20173150, KIDNE201 73430, KID NE201 76030, 

35 KIDNE20181670, KIDNE201 82540, KIDNE20186170, KIDNE20188630, KIDNE201 89890, KIDNE201 89960, 
KIDNE20191870, MESAN20038520, MESAN20041380, OCBBF2001 6390, OCBBF20 142290, OCBBF201 74890, 
PLACE60061370, PLACE60073090, PLACE60181870, PROST200 16760, PUAEN 10000650, SMINT20039050, 
SMINT20089210, SPLEN20017610, SPLEN20024930, SPLEN20057830, SPLEN20063250, SPLEN20126110, 
SPLEN201 35030, SPLEN201 36700, TESTI20070740, TESTI20262150, THYMU20009500, THYMU2001 9260, 

40 THYMU201 57620, TKIDN1 0000620, TKIDN1 0001 71 0, TKIDN1 0001 920, TRACH20011 01 0, UMVEN1 0001 380 

[0282] The result of comparative analysis of cDNA libraries derived from liver tumor (TLIVE) and normal liver (LIVER) 
showed that the genes whose expression levels were different between the two were the following clones (Table 10). 
CTONG20069320, FCBBF30236670, FEBRA20038220, FEBRA20039260, KIDNE20087880, LIVER20006260, 
LIVER20007690, LIVER20007750, LIVER2001 051 0, LI VER2001 0760, LIVER2001 0990, LIVER2001 1 640, 

45 LIVER20013890, LIVER20026440, LIVER20030650, LIVER20032340, LIVER20038000, LIVER20040740, 
LIVER20055270, MESAN20027240, NT2RI20021200, SKMUS20006790, TESTI 20035330, THYMU 10004280, 
THYMU20029830 

[0283] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and normal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were the following clones 
50 (Table 11). 

HLUNG20052300, SMINT20035050, HLUNG20041590, PROST2001 6760, BRAMY20043630, HLUNG2001 5180, 
THYMU20139160, HLUNG20020850, HLUNG20032460, BRAMY20204270, BRAMY20001510, B RAM Y2022 7860, 
CTONG20029030, CTONG201 68460, CTONG201 86290, FEBRA20039260, FEBRA20078800, FEBRA201 63980, 
HCHON20000870, HLUNG20008460, HLUNG20009260, HLUNG20009550, HLUNG20010130, HLUNG20011260, 
55 HLUNG20011440, HLUNG20011460, HLUNG20012140, HLUNG20014590, HLUNG200 15070, HLUNG20020500, 
HLUNG20021450, HLUNG20023030, HLUNG20024050, HLUNG20025620, HLUNG20028110, HLUNG20029420, 
HLUNG20029490, HLUNG20030420, HLUNG20030490, HLUNG20030610, HLUNG20031620, HLUNG20033060, 
HLUNG20033310, HLUNG20033350, HLUNG20034970, HLUNG20037140, HLUNG20037160, HLUNG20037780, 
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HLUNG20038330, HLUNG20041540, HLUNG20042730,.HLUNG20045340, HLUNG20047070, HLUNG20050760, 
HLUNG20051330, HLUNG20054790, HLUNG20055240, HLUNG20056560, HLUNG20057380, HLUNG20059240, 
HLUNG20060670, HLUNG20063700, HLUNG20065700, HLUNG20065990, HLUNG20067810, HLUNG20068120, 
HLUNG20069350, HLUNG20070410, HLUNG20072100, HLUNG20072190, HLUNG20072450, HLUNG20074330, 

5 HLUNG20079310, HLUNG20081390, HLUNG20081530, HLUNG20082350, HLUNG20083330, HLUNG20083480, 
HLUNG20083840, HLUNG20083960, HLUNG20084790, HLUNG20085210, HLUNG20088750, HLUNG20092530, 
HLUNG20093030, HLUNG20094130, KIDNE20142900, PROST20052850, SKNMC20006350, SPLEN20012450, 
TESTI20057590, TESTI20061200, TESTI20067480, TESTI201 16050, THYMU1 0004280, THYMU20010180, 
TRACH20011010, UTERU20016580, UTERU201 27030 

10 [0284] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
(NOVER) showed the genes whose expression levels were different between the two were the following clones (Table 
12). 

KIDNE20089870, NT2RP 70075300, TESTI20132310 

[0285] The result of comparative analysis of cDNA libraries derived from stomach tumor (TSTOM) and normal stom- 
as ach (STOMA) showed that the genes whose expression levels were different between the two were the following clones 
(Table 13). 

BNGH420087430, BRAMY20227860, BRAWH20027250, CTONG201 74440, FEBRA20090220, PUAEN1 0000650, 
SMINT20023110, SMINT20030740, SMINT20045890, SPLEN20048800, SPLEN20139360, TESTI2006341 0, 
TESTI201 50920, TR ACH 20026640, UTERU20041970 
20 [0286] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were the following clones 
(Table 14). 

ADRGL20020290, BRACE20038920, BRAMY20091230, BRAMY20093490, BRAMY20227860, BRHIP20005060, 
CTONG20069320, CTONG20083430, FCBBF30005360, FCBBF30257370, FEBRA20038330, FEBRA20039260, 

25 FEBRA20040260, FEBRA20078180, FEBRA20087550, HLUNG20015070, HLUNG20015180, MESAN20007110, 
MESAN20067430, MESAN20095800, NT2RP70057500, SKMUS20008730, SKNMC20006350, SMINT20035050, 
SMINT20045890, SPLEN20073880, SPLEN20076470, SPLEN201 18050, TESTI20030610, TESTI20035330, 
TESTI20057590, TESTI20059080, TESTI20105130, THYMU 10004280, THYMU20139160, UTERU1 0001 060, 
UTERU1 0001 870, UTERU20000230, UTERU20000950, UTERU20011760, UTERU20013890, UTERU200 16580, 

30 UTERU20026620, UTERU20027360, UTERU20029930, UTERU20031350, UTERU20035770, UTERU20040150, 
UTERU20040370, UTERU20040390, UTERU20040730, UTERU20041630, UTERU20041970, UTERU20045200, 
UTERU20051790, UTERU20064120, UTERU20065470, UTERU20079240, UTERU20083020, UTERU20086530, 
UTERU20087070, UTERU20087850, UTERU20089300, UTERU20089390, UTERU20089620, UTERU20090940, 
UTERU20091470, UTERU20094830, UTERU20095100, UTERU20099040, UTERU20099510, UTERU20101150, 

35 UTERU201 02260, UTERU201 03040, UTERU201 03200, UTERU201 04310, UTERU201 06510, UTERU20121140, 
UTERU201 22520, UTERU201 25810, UTERU201 27030, UTERU20127150, UTERU201 28560, UTERU201 32620, 
UTERU201 34830, UTERU20 139760, UTERU201 40010, UTERU201 67570, UTERU201 68960, UTERU201 69020, 
UTERU201 73030, UTERU201 76230, UTERU20177150, UTERU201 81270, UTERU201 85220, UTERU201 88670, 
UTERU201 88840 

*o [0287] The result of comparative analysis of cDN A libraries derived from tongue cancer (CTONG) and normal tongue 
(NTONG) showed that the genes whose expression levels were different between the two were the following clones 
(Table 15). 

ADRGL20023920, BRACE20038920, BRACE20050870, BRACE20061620, BRAMY20036530, BRAMY20076130, 
BRAMY20204270, BRAMY20267780, BRCAN20001 680, CTONG 10000090, CTONG20000340, CTONG200 02790, 

45 CTONG20004120, CTONG20004520, CTONG20007660, CTONG200081 90, CTONG20008460, CTONG200 15240, 
CTONG20017490, CTONG20020660, CTONG20020950, CTONG20027660, CTONG20029030, CTONG20030280, 
CTONG20031150, CTONG20031890, CTONG20032930, CTONG20033500, CTONG20033610, CTONG20033750, 
CTONG20035240, CTONG20036800, CTONG20036990, CTONG20039370, CTONG20041150, CTONG20041 260, 
CTONG20042640, CTONG20044230, CTONG20044870, CTONG20045500, CTONG20046690, CTONG20049480, 

50 CTONG20050490, CTONG20051100, CTONG20051450, CTONG20052780, CTONG20053990, CTONG20055670, 
CTONG20055850, CTONG20056150, CTONG20057750, CTONG20057950, CTONG20059130, CTONG20060040, 
CTONG20061290, CTONG20062730, CTONG20063770, CTONG20063930, CTONG200 65240, CTONG20065680, 
CTONG20066110, CTONG20068360, CTONG20069320, CTONG20 069420, CTONG20070090, CTONG20070720, 
CTONG20070780, CTONG20070910, CTONG20071 040, CTONG20071680, CTONG200 72930, CTONG20073990, 

55 CTONG20074000, CTONG200741 70, CTONG20074740, CTONG20076230, CTONG2007681 0, CTONG20077760, 
CTONG20078340, CTONG20079590, CTONG20080140, CTONG20081840, CTONG20083430, CTONG20083980, 
CTONG20084020, CTONG20084660, CTONG20085210, CTONG20 133720, CTONG201 65590, CTONG201 65750, 
CTONG201 66580, CTONG20 167750, CTONG201 68240, CTONG201 68460, CTONG201 69040, CTONG201 69530, 
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CTONG201 70940, CTONG201 74290, CTONG201 74580, CTONG20 176040, CTONG201 79390, CTONG20 179890, 
CTONG201 79980, CTONG201 80620, CTONG201 80690, CTONG20181350, CTONG2018343o! CTONG2018383o! 
CTONG20184130, CTONG201 84830, CTONG20186140, CTONG201 86290, CTONG201 8637o| CTONG201 86520, 
CTONG201 86550, CTONG20 188080, CTONG201 89000, CTONG20 190290, CTONG20 190630, FCBBF20070950, 

5 FCBBF30001100. FCBBF301 75350, FCBBF40005000, FEBRA20027070, FEBRA20038330, FEBRA20039260, 
FEBRA20040290, FEBRA20046200, FEBRA20063720, FEBRA20078800, FEBRA20090220, HCHON2000087o! 
HLUNG20068120, MESAN20008150, MESAN20027900, NT2NE201 53620, NT2RP70001730, NT2RP7001283o| 
NT2RP70027790, NT2RP70057500, NT2RP70064570, NT2RP70090870, NTONG20002230, NTONG20005310, 
NTONG20017620, NTONG20029850, NTONG20031580, NTONG20032100, NTONG20034540, NTONG2003515o! 

10 NTONG20043080, NTONG20048440, NTONG20049180, NTONG20053630, NTONG20053730, NTONG20053910, 
NTONG20055200, NTONG20058010, NTONG20058220, OCBBF201 10730, OCBBF201 77540, OCBBF201 77910, 
PROST20016760, PROST20042700, PROST20 050390, PROST20063430, PROST201 30320, PUAEN1 0000650, 
PUAEN 10001 640, PUAEN20003120, SKMUS20006790, SKNMC20006350, SKNSH20007160, SMINT20030740, 
SMINT200355I0, SMINT2008921 0, SPLEN20024930, SPLEN20040780, SPLEN20063250, SPLEN20 181570, 

15 SPLEN201 87490, TESTI20047370, TESTI20057880, TESTI20064530, TESTI20079980, TESTI2010513o! 
TESTI201 18460, TESTI20121040, TESTI201 97290, THYMU1 0004280, THYMU20030460, THYMU20055460, 
THYMU20089900, THYMU20121040, THYMU20139160, THYMU201 45990, TRACH20011010, TRACH20090060, 
UTERU20000230, UTERU20000950, UTERU20016580, UTERU20045200, UTERU20083020 
[0288] These genes are involved in cancers. 

20 [0289] Further, there is a method to search for genes involved in development and differentiation: the expression 
frequency analysis in which the expression levels of genes are compared between developing or differentiating tissues 
and/or cells and adult tissues and/or cells. The genes involved in tissue development and/or differentiation are genes 
participating in tissue construction and expression of function, and thus are useful genes, which are available for re- 
generative medicine aiming at convenient regeneration of injured tissues. 

25 [0290] Search was carried out for the genes whose expression frequencies were different between developing and/ 
or differentiating tissues and/or cells, and adult tissues and/or cells, by using the information of gene expression fre- 
quency based on the database of the nucleotide sequences of 770,546 clones shown above. 
[0291] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRA or OCBBF) and 
adult brain (BRACE, BRALZ, BRAMY, BRAWH, BRCAN, BRCOC, BRHIP, BRSSN, BRSTN or BRTHA) showed that 

30 the genes whose expression levels were different between the two were the following clones (Tables 1 6 to 36). 

ADRGL20020290, ADRGL20021910, ADRGL20023920, ADRGL20046760, ADRGL20062330, ADRGL20079060, 
ASTRO20009140, ASTRO20020240, ASTRO20027330, ASTRO2004751 0, ASTRO20055530, AST RO2005 5570, 
ASTRO20055930, ASTRO20090680, BGG1 12001 0750, BNGH420021680, BNGH420023870, BNGH420059680, 
BNGH420074600, BNGH420086030, BRACE1 0000510, BRACE2000331 0, BRACE20007330, BRACE20009050, 

35 BRACE20014450, BRACE20017790, BRACE20018810, BRACE20025820, BRACE20038920, BRACE20050870, 
BRACE20051600, BRACE20051930, BRACE20 052430, BRACE20052530, BRACE20054080, BRACE20054480, 
BRACE20054600, BRACE20055560, BRACE20057870, BRACE20059110, BRACE20059810, BRACE20061620, 
BRACE20062580, BRACE20 063540, BRACE20065470, BRACE20066360, BRACE2006871 0, BRACE20069000, 
BRACE20069110, BRACE20069440, BRACE20079200, BRACE20079370, BRACE20097540, BRACE20098860, 

40 BRACE20099070, BR AC E20 194670, BRACE20196180, BRACE201 96960, BRACE20200770, BRACE20200970, 
BRACE20204670, BRACE20205840, BRACE20207420, BRACE202 12450, B R AC E202 15410, BRACE2021 6700, 
BRACE20216950, BRACE20219360, BRAMY1 0000980, BRAMY1 0001 730, BRAMY2000021 0, B RAM Y2000 0250, 
BRAMY20001510, BRAMY20003540, BRAMY20003880, BRAMY20005080, BRAMY200 13670, BRAMY2001 6780, 
BRAMY20020440, B RAM Y2002 1580, BRAMY20023390, BRAMY20023640, BRAMY20024790, BRAMY20027390, 

45 BRAMY20027990, BRAMY20028530, BRAMY20028620, BRAMY20035380, BRAMY20035830, BRAMY20036530, 
BRAMY20036810, BRAMY20038980, BRAMY20039290, BRAMY20040580, BRAMY20043520, BRAMY20043630, 
BRAMY20044920, BRAMY20045210, BRAMY20045420, BRAMY20047560, BRAMY20050640, BRAMY20050940, 
BRAMY20051820, BRAMY20052440, BRAMY20053910, BRAMY20055760, BRAMY20056620, BRAMY20056840, 
BRAMY20063750, BRAMY20072440, BRAMY20072870, BRAMY20073080, BRAMY2007411 0, BRAMY20074860, 

50 BRAMY20076100, BRAMY20076130, BRAMY20076530, BRAMY20083330, BRAMY20083820, BRAMY20089770, 
BRAMY20091230, BRAMY20093490, BRAMY20094890, BRAMY20095080, BRAMY20095570, BRAMY20096930, 
BRAMY201 00680, BRAMY201 02900, BRAMY201 07980, BRAMY20111780, BRAMY201 17670, BRAMY201 18410, 
BRAMY201 18490, BRAMY20120170, BRAMY201 23400, BRAMY201 24970, BRAMY20125170, BRAMY201 25360, 
BRAMY201 25550, BRAMY201 26910, BRAMY201 27310, BRAMY20 127760, BRAMY20 134050, BRAMY201 35720, 

55 BRAMY201 37360, B R AM Y20 139440, BRAMY201 39750, BRAMY20 143870, BRAMY2015251 0, BRAMY201 55500, 
BRAMY201 58550, BRAMY201 59250, BRAMY201 60020, BRAMY201 73480, BRAMY20 190550, BRAMY201 94680, 
BRAMY20204270, BRAMY20206340, BRAMY20219620, BRAMY20221 600, BRAMY2022301 0, BRAMY2022525o! 
BRAMY20225320, BRAMY20227230, BRAMY20227860, BRAMY20227960, BRAMY20231150, BRAMY20234820, 
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BRAMY20237190, 

BRAMY20245760, 

BRAMY20271140, 

BRAWH20020600, 

BRAWH20039640, 

BRAWH20055780, 

BRAWH20082550, 

BRAWH201 73790, 

BRAWH20183170, 

BRAWH201 90550, 

BRCAN20004180, 

BRHIP1 0000720, 

BRSSN20005610, 

BRSSN20093890, 

CTONG20070720, 

CTONG20080140, 

CTONG201 83430, 

FCBBF20018680, 

FCBBF20032930, 

FCBBF20038950, 

FCBBF20061310, 

FCBBF30001100, 

FCBBF30004340, 

FCBBF30019180, 

FCBBF30035570, 

FCBBF30063990, 

FCBBF30074620, 

FCBBF30081000, 

FCBBF30093170, 

FCBBF30101240, 

FCBBF301 07290, 

FCBBF301 18670, 

FCBBF301 30410, 

FCBBF301 38000, 

FCBBF301 57270, 

FCBBF301 70710, 

FCBBF30179180, 

FCBBF301 95700, 

FCBBF30220050, 

FCBBF30230610, 

FCBBF30260210, 

FCBBF30282020, 

FCBBF50000410, 

FEBRA20007820, 

FEBRA20031150, 

FEBRA20038330, 

FEBRA20040290, 

FEBRA20057010, 

FEBRA20082660, 

FEBRA20088810, 

FEBRA20093280, 

FEBRA201 08580, 

FEBRA201 50420, 

FEBRA201 75020, 

FEBRA20191720, 

HLUNG20014590, 

HLUNG20032460, 

HLUNG20081390, 



BRAMY20238630, BRAMY20243120, BRAMY20244490, BRAMY20245140 
BRAMY20251210, B RAM Y2025 1750, BRAMY20263000, BRAMY20267780 
BRAMY20274510, BRAMY20285650, BRAMY20287400, BRAWH20014590, 
BRAWH20021910, BRAWH20025490, BRAWH20026010, BRAWH20027250 
BRAWH20040680, BRAWH20047790, BRAWH20050740, BRAWH20055240 
BRAWH20058120, BRAWH20063010, BRAWH20078080, BRAWH20078620 
BRAWH20082920, BRAWH20093040, BRAWH20093070, BRAWH20094900 
BRAWH201 74330, BRAWH201 75230, BRAWH201 75340, BRAWH201 76850 
BRAWH201 85260, BRAWH201 85270, BRAWH201 86010, BRAWH201 88750 
BRAWH20191980, BRCAN1 0000760, BRCAN 10001 050, BRCAN1 0001 680 
BRCAN20005230, BRCAN20005410, BRCOC1 0000400, BRCOC20000470 
BRHIP10001040, BRHIP20000210, BRHIP20003590, BRHIP20005060, 
BRSSN20005660, BRSSN20066440, BRSSN20074640, BRSSN200911 90 
CTONG20032930, CTONG20035240, CTONG20044870, CTONG20063930 
CTONG20071040, CTONG20071 680, CTONG20074170, CTONG20078340 
CTONG20085210, CTONG20 133720, CTONG201 65750, CTONG20 168240, 
CTONG201 86370, CTONG201 88080, FCBBF1 0000230, FCBBF1 0002200 
FCBBF20020440, FCBBF2002111 0, FCBBF20023490, FCBBF20028980 
FCBBF20033360, FCBBF20035430, FCBBF20035490, FCBBF20036360 
FCBBF20041380, FCBBF20043730, FCBBF20054390, FCBBF20056580 
FCBBF20066340, FCBBF20070800, FCBBF20070950, FCBBF30000010 
FCBBF30001150, FCBBF30002270, FCBBF30002280, FCBBF30002330 
FCBBF30004730, FCBBF30005180, FCBBF30005360, FCBBF30005500 
FCBBF30019240, FCBBF30021900, FCBBF30022680, FCBBF30026580 
FCBBF30042610, FCBBF30048420, FCBBF30053300, FCBBF30056980 
FCBBF30068210, FCBBF30071500, FCBBF30072440, FCBBF30072480 
FCBBF30075970, FCBBF3007631 0, FCBBF30078600, FCBBF30079770 
FCBBF30085560, FCBBF30088700, FCBBF30089380, FCBBF30091 010 
FCBBF30095410, FCBBF30099490, FCBBF301 00080, FCBBF30100120 
FCBBF30101300, FCBBF301 05080, FCBBF30 105440, FCBBF301 05860 
FCBBF301 07330, FCBBF30114180, FCBBF301 14850, FCBBF301 15230 
FCBBF301 18890, FCBBF301 25460, FCBBF301 25880, FCBBF301 28420 
FCBBF301 30580, FCBBF301 32050, FCBBF301 32660, FCBBF301 35890 
FCBBF301 42290, FCBBF301 43550, FCBBF301 45670, FCBBF30151190 ! 
FCBBF30161780, FCBBF301 64510, FCBBF301 66220, FCBBF301 69280 
FCBBF301 71230, FCBBF301 72330, FCBBF301 73960, FCBBF301 75350 
FCBBF301 79740, FCBBF30181 730, FCBBF301 94370, FCBBF301 94550 
FCBBF301 97840, FCBBF301 98670, FCBBF30201 630, FCBBF30212210 
FCBBF30222910, FCBBF30223110, FCBBF3022321 0, FCBBF30225930 
FCBBF30236670, FCBBF30250980, FCBBF30255680, FCBBF30257370 
FCBBF30260480, FCBBF30263080, FCBBF30266510, FCBBF30271 990 
FCBBF30285930, FCBBF30287940, FCBBF40000610, FCBBF40001920 
FCBBF50000610, FCBBF50001650, FCBBF50003530, FCBBF50004950 
FEBRA20018670, FEBRA20026820, FEBRA20027070, FEBRA20029620, 
FEBRA20031280, FEBRA20031810, FEBRA20035200, FEBRA20035240, 
FEBRA20038970, FEBRA20039070, FEBRA20039260, FEBRA20040230 
FEBRA20040560, FEBRA20045380, FEBRA20046200, FEBRA20046280 
FEBRA20063720, FEBRA20076200, FEBRA20078180, FEBRA20078800 
FEBRA2008341 0, FEBRA20084750, FEBRA20086600, FEBRA2 0087550, 
FEBRA20090160, FEBRA20090220, FEBRA20091620, FEBRA20092760 
FEBRA20095410, FEBRA20098040, FEBRA20099860, FEBRA20101410 
FEBRA201 15930, FEBRA201 16650, FEBRA20121200, FEBRA20121950 
FEBRA20151750, FEBRA201 63980, FEBRA201 70240, FEBRA201 72230 
FEBRA201 75330, FEBRA201 77800, FEBRA201 80510, FEBRA20182030 ; 
HCHON20002650, HCHON20002710, HEART1 0001 490, HLUNG20008460, 
HLUNG20015070, HLUNG20015180, HLUNG20020850, HLUNG20028110, 
HLUNG20033060, HLUNG20041590, HLUNG20045340, HLUNG20056560 
HLUNG20083480, HLUNG20085210, HLUNG200941 30, KIDNE20080690 



BRAMY20245350, 
BRAMY20269040, 
BRAWH20020470, 
BRAWH20030000, 
BRAWH20055330, 
BRAWH20080580, 
BRAWH20095900, 
BRAWH201 82670, 
BRAWH201 90530, 
BRCAN20001480, 
BRCOC20003600, 
BRSSN20001970, 
BRSSN20092440, 
CTONG20069320, 
CTONG20079590, 
CTONG201 70940, 
FCBBF1 0004760, 
FCBBF20029280, 
FCBBF20038230, 
FCBBF20059660, 
FCBBF30001020, 
FCBBF30003610, 
FCBBF30019140, 
FCBBF30029250, 
FCBBF30062490, 
FCBBF30074530, 
FCBBF30080730, 
FCBBF30091520, 
FCBBF301 00410, 
FCBBF301 06950, 
FCBBF301 15920, 
FCBBF301 29010, 
FCBBF301 36230, 
FCBBF30153170, 
FCBBF301 69870, 
FCBBF301 77290, 
FCBBF301 95690, 
FCBBF30215240, 
FCBBF30228940, 
FCBBF30259050, 
FCBBF30275590, 
FCBBF40005000, 
FEBRA20005040, 
FEBRA20031000, 
FEBRA20038220, 
FEBRA20040260, 
FEBRA20046510, 
FEBRA20080860, 
FEBRA20088610, 
FEBRA20093270, 
FEBRA20 108020, 
FEBRA20141980, 
FEBRA201 73330, 
FEBRA201 87460, 
HLUNG20011460, 
HLUNG20031620, 
HLUNG20068120, 
, KIDNE20084030, 
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KIDNE20086660, KIDNE20094670, KIDNE20134130, KIDNE201 38450, KIDNE201 40870, KIDNE201 49780, 
KIDNE201 70400, KIDNE201 73430, MESAN20021860, MESAN20030350, MESAN20034440, MESAN20038520, 
MESAN20046750, MESAN20067430, MESAN20089260, MESAN20095800, NT2NE20026200, NT2NE20033150, 
NT2NE20042550, NT2NE20045190, NT2NE20053950, NT2NE20061030, NT2NE20069580, NT2NE20082130, 

5 NT2NE20082600, NT2NE20088030, NT2NE20092950, NT2NE20095230, NT2NE20 108420, NT2NE201111 90, 
NT2NE201 12210, NT2NE20141040, NT2NE201 77210, NT2NE20181800, NT2RI20021200, NT2RP70001120, 
NT2RP70001730, NT2RP700 12830, NT2RP70035110, NT2RP70057500, NT2RP70075300, NT2RP70087140, 
NT2RP70090870, NTONG20002230, NTONG2001 7620, NTONG200491 80, NTONG20055200, OCBBF20000740, 
OCBBF20001780, OCBBF20005220, OCBBF20009820, OCBBF20011 860, OCBBF20012520, OCBBF2001 6390, 

10 OCBBF20016810, OCBBF201 09450, OCBBF201 09780, OCBBF201 10210, OCBBF201 10730, OCBBF20111370, 
OCBBF20111600, OCBBF201 12280, OCBBF201 12320, OCBBF20113110, OCBBF20115360, OCBBF201 16250, 
OCBBF201 17220, OCBBF201 18720, OCBBF201 19810, OCBBF201 20010, OCBBF201 20950, OCBBF20121910, 
OCBBF201 23200, OCBBF20 142290, OCBBF201 47070, OCBBF201 52330, OCBBF201 55030, OCBBF201 56450, 
OCBBF201 57970, OCBBF201 60380, OCBBF201 65900, OCBBF201 65910, OCBBF201 66890, OCBBF201 66900, 

15 OCBBF201 67290, OCBBF201 70350, OCBBF20 174580, OCBBF201 74890, OCBBF201 75360, OCBBF201 76650, 
OCBBF201 77540, OCBBF201 77910, OCBBF201 82060, OCBBF201 85630, OCBBF201 88280, OCBBF20191950, 
PLACE60054820, PLACE60056910, PLACE60061370, PLACE60064740, PLACE60073090, PLACE601 20280, 
PLACE601 32200, PLACE601 50510, PLACE60 154450, PLACE6015731 0, PLACE60162100, PROST1 00021 50, 
PROST20014150, PROST2001 6760, PROST20024250, PROST200351 70, PROST20035830, PROST20042700, 

20 PROST20045700, PROST20050390, PROST20054660, PROST2007871 0, PROST20094000, PROST20097310, 
PROST20097840, PROST201 03820, PROST20114100, PROST20 130320, PROST20151370, PUAEN1 0000650, 
PUAEN1 0001 640, PUAEN20003120, SKNMC20006350, SKNSH1 0001 010, SKNSH20007160, SKNSH20030640, 
SKNSH20094350, SMINT20000070, SMINT20002320, SMINT20030740, SMINT20039050, SMINT20045890, 
SMINT20047290, SMINT20048720, SMINT20056240, SMINT20077920, SMINT20088690, SMINT20089210, 

25 SMINT20089600, SMINT20094150, SPLEN20005160, SPLEN20005370, SPLEN20012450, SPLEN20024930, 
SPLEN20040780, SPLEN20048800, SPLEN20055600, SPLEN20057830, SPLEN20063250, SPLEN20071820, 
SPLEN20073880, SPLEN20076470, SPLEN201 04690, SPLEN20114190, SPLEN201 25230, SPLEN201 35030, 
SPLEN201 36700, SPLEN201 75920, SPLEN201 81570, SPLEN201 83020, SPLEN201 87490, SPLEN201 93490, 
SPLEN201 93790, SPLEN201 97740, SPLEN20200070, SPLEN20200340, TESOP1 0000350, TESTI20005980, 

30 TESTI20030440, TESTI20030610, TESTI20031410, TESTI20035330, TESTI20047370, TESTI20050400, 
TESTI20050720, TESTI20053780, TESTI20057430, TESTI20057590, TESTI20057840, TESTI20057880, 
TESTI20059080, TESTI20061200, TESTI20062580, TESTI20063410, TESTI20064530, TESTI20066280, 
TESTI20067480, TESTI20071 630, TESTI20079980, TESTI20081 890, TESTI20089290, TESTI200901 80, 
TESTI201 05130, TESTI201 061 70, TESTI201 21 040, TESTI201 50920, TESTI201 69500, TESTI201 93080, 

35 TESTI20215310, TESTI 2022 1790, TESTI 20245860, TESTI20252690, TESTI20254090, TESTI20261160, 
TESTT20262150, TESTI20274960, THYMU20007750, THYMU20009460, THYMU20009710, THYMU20019260, 
THYMU20028410, THYMU20030460, THYMU20031330, THYMU20043440, THYMU20044100, THYMU20044520, 
THYMU20049060, THYMU20055460, THYMU20055740, THYMU20071120, THYMU20078020, THYMU20089900, 
THYMU20091040, THYMU201 04480, THYMU201 20240, THYMU20139160, THYMU20 143230, THYMU20150190, 

40 THYMU201 57620, THYMU201 76010, TKIDN1 0001 920, TRACH20012490, TRACH20021000, TRACH20026640, 
TRACH20058000, TRACH20090060, TRACH201 59390, UMVEN1 0001 380, UTERU1 0001 060, UTERU20000230, 
UTERU20000950, UTERU20026620, UTERU20041970, UTERU20065470, UTERU20079240, UTERU20083020, 
UTERU20089300, UTERU20089390, UTERU20095100, UTERU201 02260, UTERU201 03200, UTERU20127150, 
UTERU201 28560 

45 [0292] The result of comparative analysis of cDNA libraries derived from fetal heart (FEHRT) and adult heart (HEART) 
showed that the genes whose expression levels were different between the two were the following clones (Table 37). 
BRAMY20043630, BRAMY20072870, BRAMY20227860, BRAWH20093070, BRCAN1 0001 680, FCBBF30053300, 
FEBRA20078800, FEBRA20090220, HCHON20000870, HEART1 0001 420, HEART1 0001 490, HEART20009590, 
HEART20019310, HEART20022200, HEART20031 680, HEART20047640, HEART200631 00, HEART20082570, 

50 HLUNG20083960, PLACE60088240, PLACE601 20280, PROST20016760, PROST200351 70, PROST20062820, 
PROST201 27450, SKMUS20006790, SKMUS20008730, TESTI20270130 

[0293] The result of comparative analysis of cDNA libraries derived from fetal kidney (FEKID) and adult kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 38). 

55 ASTRO20009140, BGGI1 2001 0750, BRACE20054480, BRACE20062580, BRACE2021 9360, BRAMY20001510, 
BRAMY20003540, BRAMY20003880, BRAMY20043630, BRAMY20204270, CTONG20033750, CTONG20039370, 
CTONG20045500, FCBBF20023490, FEBRA20039260, FEBRA20040290, HEART1 0001 490, HLUNG20041590, 
HLUNG20068120, HLUNG20072450, HLUNG20083960, KIDNE20011 600, KIDNE20016360, KIDNE20024380, 
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KIDNE20027980, 
KIDNE20084040, 
KIDNE20087880, 



KIDNE20080690, 
KIDNE20084730, 
KIDNE20088240, 



KIDNE20095530, KIDNE20 133460, 

KIDNE20138450, KIDNE20 140870, 

KIDNE20143200, KIDNE20147170, 

KIDNE20154330, KIDNE20 154830, 



KIDNE20081170, KIDNE20083150, KIDNE20083620, 
KIDNE20084800, KIDNE20086490, KIDNE20086660, 
KIDNE20089870, KIDNE20091090, 
KIDNE20133880, KIDNE20134130, 
KIDNE20141120, KIDNE20141700, 
KIDNE20149780, 



KIDNE20084030, 
KIDNE20086970, 
KIDNE20094260, KIDNE20094670, 



KIDNE20163710, 
KIDNE201 76030, 



KIDNE201 65390, 
KIDNE20181670, 



KIDNE20148080, 

KIDNE20155980, KIDNE20157100, 

KIDNE20169180, KIDNE201 70400, 

KIDNE201 82540, KIDNE20186170, 



KIDNE20134890, 
KIDNE201 42680, 
KIDNE20150730, 
KIDNE20160360, 
KIDNE20173150, 
KIDNE20188630, 



KIDNE20137310, 
KIDNE20142900, 
KIDNE201 52440, 
KIDNE201 60960, 
KIDNE201 73430, 
KIDNE201 89890, 



KIDNE201 89960, KIDNE20191870, OCBBF20 174890, PLACE60073090, PLACE60181870, PROST2001 6760, 
PUAEN1 0000650, SKNMC20006350, SPLEN20017610, SPLEN20063250, SPLEN20126110, SPLEN20135030, 
TESTI20061200, TESTI20262150, THYMU 10004280, THYMU20139160, TRACH20011 010 
[0294] The result of comparative analysis of cDNA libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were the following clones (Table 39). 
BRAMY20001510, BRAMY20043630, BRAMY20204270, BRAMY20227860, CTONG20029030, CTONG20 168460, 
CTONG201 86290, FEBRA20039260, FEBRA20078800, FEBRA20 163980, HCHON20000870, HLUNG20008460, 
HLUNG20009260, HLUNG20009550, HLUNG20010130, HLUNG20011 260, HLUNG20011440, HLUNG20011460, 
HLUNG20012140, HLUNG20014590, HLUNG20015070, HLUNG200151 80, HLUNG20020500, HLUNG20020850, 
HLUNG20021450, HLUNG20023030, HLUNG20024050, HLUNG20025620, HLUNG20028110, HLUNG20029420, 
HLUNG20029490, HLUNG20030420, HLUNG20030490, HLUNG20030610, HLUNG20031620, HLUNG20032460, 
HLUNG20033060, HLUNG20033310, HLUNG20033350, HLUNG20034970, HLUNG20037140, HLUNG20037160, 
HLUNG20037780, HLUNG20038330, HLUNG20041540, HLUNG20041590, HLUNG20042730, HLUNG20045340, 
HLUNG20047070, HLUNG20050760, HLUNG20051330, HLUNG20052300, HLUNG20054790, HLUNG20055240, 
HLUNG20056560, HLUNG20057380, HLUNG20059240, HLUNG20060670, HLUNG20063700, HLUNG20065700, 
HLUNG20065990, HLUNG2006781 0, HLUNG20068120, HLUNG20069350, HLUNG2007041 0, HLUNG20072100, 
HLUNG20072190, HLUNG20072450, HLUNG20074330, HLUNG20079310, HLUNG20081390, HLUNG20081530, 
HLUNG20082350, HLUNG20083330, HLUNG20083480, HLUNG20083840, HLUNG20083960, HLUNG20084790, 
HLUNG20085210, HLUNG20088750, HLUNG20092530, HLUNG20093030, HLUNG20094130, KIDNE20142900, 
PROST20016760, PROST20052850, SKNMC20006350, SMINT20035050, SPLEN20012450, TESTI20057590, 
TESTI20061200, TESTI20067480, TEST1201 16050, THYMU 1 0004280, THYMU20010180, THYMU20139160, 
TRACH20011010, UTERU200 16580, UTERU201 27030 
[0295] These genes are involved in regeneration of tissues and/or cells. 



EXAMPLE 8 



Expression frequency analysis by PCR 

[0296] Specific PCR primers were prepared based on the full-length nucleotide sequences, and the expression fre- 
quency was analyzed by the ATAC-PCR method (Adaptor-tagged competitive PCR method: Nucleic Acids Research 
1997, 25(22): 4694-4696; "DNA Micro-array and Advanced PCR Techniques", Cell Technology, suppiement, Eds., 
Muramatsu and Nawa (Shujunsha, 2000): 104-112). Inflammation-related genes can be identified by revealing the 
genes whose expression levels are altered depending on the presence of an inflammation-inducing factor. Then, by 
using THP-1 cell line, which is a cell line of monocyte line, andTNF-a, which is an inflammation-inducing factor, suitable 
for this system, the genes whose expression levels are altered depending on the presence of the factor were searched 
for by the system. 

[0297] THP-1 cell line (purchased from DAINiPPON PHARMACEUTICAL) wascultured. to beconfluent in RPMI1640 
medium (sigma) containing 5% fetal calf serum (GIBCO BRL). Then, the medium was changed with the medium con- 
taining 1 0 ng/ml TNF-oc (human recombinant TNF-a; Pharmacia Biotech), and the culture was continued at 37°C under 
5% C0 2 . Afterthree hours, the cells were harvested, and total RN A was extracted from them by using ISOGEN reagent 
(Nippon Gene). The extraction was carried out according to the method in the document attached to ISOGEN reagent. 
In addition, total RNA was also extracted from the cells cultured without stimulation of TNF-a. 
[0298] The genes involved in the onset of gastritis and gastroduodenal ulcer induced by the infection of Helicobacter 
pyloriXo the epithelia of stomach can be identified by revealing the genes whose expression levels are altered depending 
on co-culturing the cells with Helicobacter pylori A recent study has suggested that various substances derived from 
Helicobacter pylori trigger the inflammation reaction. In particular, the members belonging to the family of genes called 
"cag pathogenicity island (cag PAI)" contribute to the activation of the NF-kB pathway (Gastroenterology 2000, 119: 
97-108). Further, it has been found that cag PAI is involved in the onset of gastritis and the like by the study using an 
animal model (Journal of Experimental Medicine 2000, 192:1601-1610). Then, by using co-cutture of a gastric cancer 
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cell line with cag PAI-positive Helicobacter pylori (TN2), suitable for this system, the genes whose expression levels 
are altered depending on the presence of Helicobacter pylori were searched for by the system. Further, in order to 
study the involvement of cag PAI in the alterations of gene expression levels depending on the co-culture with Helico- 
bacter pylori, the altered expression levels were compared between the cells co-cultured with a strain of Helicobacter 
5 pylori (TN 2 Acag E strain) having a mutation in cagE, which is one of the cag PAI genes, and the cag PAI-positive strain 
(TN2). 

[0299] A gastric cancer cell line MKN45 (provided by the Cell Bank, RIKEN GENE BANK, The institute of Physical 
and Chemical Research) was cultured to be confluent in RPMI1 640 medium (sigma) containing 10% fetal calf serum 
(GIBCO BRL). Then, the medium was changed with the medium containing 100-fold excess (in terms of the number 

10 of cells or the number of colonies) of Helicobacter pylori (cag PAI positive strain (TN2) and cagE mutant (TN2AcagE): 
both were provided by Prof. Omata, Faculty of Medicine, The University of Tokyo), as compared with the number of 
the cancer cells. The culture was continued at 37°C under 5% C0 2 . After three hours, the cells were harvested, and 
total RNA was extracted from them by using ISOGEN reagent (Nippon Gene). The extraction was carried out according 
to the method in the document attached to ISOGEN reagent. In addition, total RNA was also extracted from the cells 

is cultu red without Helicobacter pylori. 

[0300] The analysis by the ATAC-PCR method was carried out basically according to "DNA Micro-array and Ad- 
vanced PCR Techniques", Cell Technology, supplement (Genome Science Series 1 , Eds., Muramatsu and Nawa (Shu- 
junsha, 2000): 1 04-1 1 2). Adapter ligation to the internal standard sample (sample to make the calibration curve for the 
clone of interest) and test sample was carried out in the two separate reaction systems indicated below. The combination 

20 of 6 types of adapters (AD-1 , AD-2, AD-3, AD-4, AD-5 and AD-6: see the sequences indicated below) and the samples 
are as follows. 

Reaction system A 

25 [0301] 

AD1; internal standard, 10-fold 

AD2; THP-1 cells, unstimulated 

AD3; internal standard, 3-fold 
30 AD4; THP-1 cells, TNF-a stimulation for one hour 

AD5; THP-1 cells, TNF-a stimulation for three hours 

AD6; internal standard, 1-fold 

Reaction system B 

AD1; internal standard, 1-fold 
35 AD2; MKN45 cells, unstimulated 

AD3; internal standard, 3-fold 

AD4; MKN45 cells, co-cultured with TN2 (Helicobacter pylon) 
AD5; internal standard, 10-fold 

AD6; MKN45 cells, co-cultured with TN2AcagE (cagE gene mutant) Adapter sequences: 

40 

AD1 ; 

SEQ ID NO: 394 1//5 • -GTACATATTGTCGTTAGAACGCG-3 4 
45 SEQ ID NO: 3942//3 ' -CATGTATAACAGCAATCTTGCGCCTAG-5 ' 

AD2; 

SEQ ID NO: 3943/ /5 ' -GTACATATTGTCGTTAGAACGCGACT-3 9 

50 

SEQ ID NO: 3944 //3 • -CATGTATAACAGCAATCTTGCGCTGACTAG-5 • 



AD 3 ; . 

SEQ ID NO: 3945//S • -GTACATATTGTCGTTAGAACGCGCATACT-3 1 
SEQ ID NO: 3946//3 ' -CATGTATAACAGCAATCTTGCGCGTATGACTAG-5 1 
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AD4; 

SEQ ID NO: 3947//5 • -GTACATATTGTCGTTAGAACGCGATCCATACT-3 ' 

SEQ ID NO: 3948//3 ' -CATGTATAACAGCAATCTTGCGCTAGGTATGACTAG-5.' 

ADS; 

SEQ ID NO: 3949//5 1 -GTACATATTGTCGTTAGAACGCGTCAATCCATACT-3 ' 

SEQ ID NO: 3950//3 ' -CATGTATAACAGCAATCTTGCGCAGTTAGGTATGACTAG-5 ' 

AD6; 

SEQ ID NO: 3951//5 1 -GTACATATTGTCGTTAGAACGCGTACTCAATCCATACT-3 • 



SEQ ID NO: 3952//3 ' -CATGTATAACAGCAATCTTGCGCATGAGTTAGGTATGACTAG- 
5' 

[0302] The internal standard sample used for this assay was a mixture of total RNAs from tissues (or culture cells; 
all from UNITECH) of brain, kidney, NT2, testis, thymus, and trachea. RNA was prepared according to the standard 
method. 

[0303] The sequences of primers specific to the genes and the names of clones of interest in the analysis are as 
follows. The gene specific primers were designed to produce the PCR products of 70 to 200 bp, which are derived 
from the adapter-containing cDNA. The sequence of adapter-specific primer (labeled with fluorescence (FAM)) used 
in the competitive PCR was GTAC ATATTGTCGTTAG AACG C (22 nucleotides; SEQ ID NO: 3953). PCR was basically 
carried out with a cycling profile of preheating at 94°C for 3 minutes, and 35 or 40 cycles of denaturation at 94°C for 
30 seconds/annealing at 50°C for 60 seconds/extension at 72°C for 90 seconds. 
The nucleotide sequences of clone specific primers used in the experiments 

[0304] Clone name, primer sequence and SEQ ID NO are indicated below in this order. Each is demarcated by a 
double slash mark (II). 



ADRGL2 0036380/ / CTACTCAAGGACAGCCACAC/ / SEQ 


ID 


NO: 


3954 


ASTROS 0 045840/ / GGATGTAGTGGGAAACAATG / / SEQ 


ID 


NO: 


3955 


ASTRO20 055930/ /TGCTTTTCATTCTCCTTAGT// SEQ 


ID 


NO: 


3956 


ASTR02 0088950// TACGTGCTCATTTACTTGGT / / SEQ 


ID 


NO: 


3957 


BNGH420052350//GCCAGTTTCTTTATGATTGA//SEQ 


ID 


NO: 


3958 


BRACE20052530//AATGACTTCGTTAGGATGCC//SEQ 


ID 


NO: 


3959 


B RACE20 054080/ / GCTGTTGACTTCATTTGGAA/ / SEQ 


ID 


NO: 


3960 


BRAMY20003880//TTGGTACTTATTCTGAGGCA//SEQ 


ID 


NO: 


3961 


BRAMY20 027390/ / GATTTTAGTGAAACATGCCA/ /SEQ 


ID 


NO: 


3962 


BRAMY2 0 0 2 8 5 3 0 / / TT AAAACTG AGGACATTCTG/ / SEQ 


ID 


NO: 


3963 


BRAMY2 0035380// AG AGAAGGCAGTCTAGCTTA/ / SEQ 


ID 


NO: 


3964 


BRAMY2 0036530/ / AGGGTATGGTAACTTCTGCA/ /SEQ 


ID 


NO: 


3965 


BRAMY2 0050940/ / AAAG GAGGG AC TAG AAAACT/ / SEQ 


ID 


NO: 


3966 


BRAMY20 072440/ / ACT ATGACGAGGGAACAAGA/ / SEQ 


ID 


NO: 


3967 
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BRAMY2 0 096930/ /G AGGAG AACACAAGTATGGT/ / SEQ 


ID 


NO ; 


3968 


BRAMY2 0 1184 1 0/ / AAGGTCACTTCTAAACACAC/ / SEQ 


ID 


NO: 


3969 


BRAMY2 02 3 7190 / /GGAGTGATTCAGGAGATGTG/ / SEQ 


ID 


NO: 


3970 


BRAWH2 0 055330 / /GCAACAGAGACTTTATTGGT/ / SEQ 


ID 


NO : 


3971 


B RAWH 20078620// GAG AG AC TT AT C AC AG C CAT //SEQ 


ID- 


NO: 


3972 


BRAWH20190530//ATGGGATTCTGTGACTTCTC/ /SEQ 


ID 


NO: 


3973 


BRCAN20 0 01480// CAGCAACAGTAATGGGAATT/ /SEQ 


ID 


NO: 


3974 


BRHIP10000720//AGGTTAGGATTTCTTTAGCA/ /SEQ 


ID 


NO: 


3975 


BRHIP10001040// T ACTTGGAGACAAC AGGG AG/ /SEQ 


ID 


NO: 


3976 


BRHI P20000210/ /GTGTTTGTGGGCATAGACAT/ /SEQ 


ID 


NO: 


3977 


BRSSN20001970//AATC5TATTCAGTTCCTTTCC//SEQ 


ID 


NO: 


3978 


BRSSN20091190/ / GTGTCATCACTAGCACCAAG/ /SEQ 


ID 


NO: 


3979 


CD34C20001750/ / TGGACTTAGGGACCTGACTC / /SEQ 


ID 


NO: 


3980 


CT0NG2 0078340/ / CTCTTTACCTAGTTTGGTCA/ /SEQ 


ID 


NO: 


3981 


CTONG20 079590 / /TACTTATTTTCACAGGGGCC/ / SEQ 


ID 


NO: 


3982 


CT0NG2 0083980/ /CAGCATTTTCCTATATAGCC/ /SEQ 


ID 


NO: 


3983 


CT0NG2 0085210 / / CCAGAAG AGTAGCAAGAATT / / SEQ 


ID 


NO: 


3984 


DFNES20063460/ /CTATTTTAACCCCTGCCCTC/ /SEQ 


ID 


NO: 


3985 


DFNES20 072990/ /GGAGGTATCTATTAGGGTGA/ /SEQ 


ID 


NO: 


3986 


FCBBF2 0029280 / /GACTGAGATGAACTGGAAGA/ /SEQ 


ID 


NO: 


3987 


FCBBF2 0032930/ /TCACAATACAGTCCCCTAGT / / SEQ 


ID 


NO: 


3988 


FCBBF2 0036 360/ /ATTTGTATCACTTTGGTGCA//SEQ 


ID 


NO: 


3989 


FCBBF3 0022680/ /CTCCAGAAAATGCATGAAT C / / SEQ 


ID 


NO: 


3990 


FCBBF30 0 78 6 00 //CTTCAACAGTGCTTTTCCTT / / SEQ 


ID 


NO: 


3991 


FCBBF30 1 0 5 0 80 //CTGTGCACCCACTCTTTATT/ /SEQ 


ID 


NO: 


3992 


FCBBF3 0169870 //TCCAGTATTTTCCACTTTGA/ /SEQ 


ID 


NO: 


3993 


FCBBF3 02 2 593 0//ACTATTTTATGGTCACGGCC/ / SEQ 


ID 


NO: 


3994 


FCBBF5 000061 0 // AGTTAACGTATCTGGCAAAG/ /SEQ 


ID 


NO: 


3995 


FEBRA20 0 0 7 82 0//GTTTCTCACTGTCCTGTTTT/ /SEQ 


ID 


NO: 


3996 


FEBRA2 003128 0 // ACTATTTTATGGTCACGGCC/ /SEQ 


ID 


NO: 


3997 


FEBRA2 0031810// TGCAATCATCTCTGTATCCC/ / SEQ 


ID 


NO: 


3998 


FEBRA2 0039260 // GTCAGAACCC ACTTCACATC/ / SEQ 


ID 


NO: 


3999 


FEBRA2 0046280/ / TCTCTGTCCTGTTGTCTAAG/ / SEQ 


ID 


NO: 


4000 


FEBRA2 0084750/ / TTAGCATGTACTGGGAAAGC/ / SEQ 


ID 


NO: 


4001 


FEBRA2 0182030 / / AAAACACAAAATGACACCCC/ / SEQ 


ID 


NO: 


4002 


HLUNG2 0041540/ / AAAGTTCCTCTGCATTCACC/ / SEQ 


ID 


NO: 


4003 



126 



EP 1 308 459 A2 



HLUNG2 0092530/ / TTTTCATCCCAGAGTT ATTA/ / SEQ 


ID 


NO: 


4004 


KIDNE20 084030/ / AGGGAATAACTTGCAGCTTG/ / SEQ 


ID 


NO: 


4005 


KIDNE20 0 84800 / /GTAATGTAGGGAGACTGCCG/ /SEQ 


ID 


NO: 


4006 


KIDNE20 134130/ / AATCCCCTCTTTTGTCTCAT/ /SEQ 


ID 


NO: 


4007 


KIDNE2 0182540/ / ACAGATAGCCTGGATTGAAA/ / SEQ 


ID-. 


NO: 


4008 


KIDNE 20186170/ / TTGTATCTGAGCTGGGGTTT/ / SEQ 


ID 


NO: 


4009 


KIDNE20 188630/ / CCCTACATATCTCTACCCAT/ / SEQ 


ID 


NO: 


4010 


L IVER2 0007750// TATTTAGAAACGCAGACCCC/ / SEQ 


ID 


NO: 


4011 


MES AN 2 0 021220// TAGAAGTCAACAAAAGGCAC/ / SEO 


ID 


NO' 


4012 

T V X C 


MESAN2G084150/ /TCCATAAGGCACAGATTTGAy y SEO 


ID 


NO- 


*a w X «J 


NT2NE2 00 59210 / /ATAATGACAATGCCAGTAGT//SEO 


ID 


NO • 


4014 


NT7NE90nR 9 1^0/ /TGAGRT APATrfA AATTAAA J / 3PO 


X Lf 


IN v • 


T V X J 


NT7NE20097950 / /ATGATTArTrGGTTTCrAGA/ /SCO 


x u 


MO • 






T n 

X V 


Kin - 


fx W JL / 


OCBBF2001 9 59 A / /TrTGrPTnTAnTTGnrATTA/ /^PO 


X Lf 


MO • 


401 ft 
*± v x> 0 




tn 
lu 


NU . 


a m q 
ft uiy 


nrRRp?o i i 0.7 ^ a / yTTAf5AT(2f , T , fTPTAAftG r prn y /qph 


Tn 

X L/ 




4 fi 9 A 


oprrp?a i s sn ^ a / yGPTAAAATpfSTf^PATPTnTAy /^fo 


th 
li/ 


MO • 

INU . 


ft U Z 1 


UtotJP /iUiDjyUl// / A(jI a X IbiAltlLUl xbx UA/ / b&U 


t r\ 
ID 


NO : 


4 022 


vjuddt ^Ul/U^OU / / x AAV3Ax\jv?A<jx lUAljVjbvjAVj/ / oLy 


t n 
IU 


NU : 


A t\ O *3 




t n 


NU: 


y| A o >i 
4U24 


PLAPPfinnofiiftft / yTTPTPTA ata apppptptpa / / crn 


Tn 


IMU . 


fiu^O 


PT APFf\ftft£1 77A / /TPT"rr , Ar*AAA , PAi':r , aTAA aa / /cita 


t n 
xU 


VTA . 

nu : 


/t A o c 
4U2d 


fKUOlZUUx 11DU/ / L I AL 1 AAL 1 L AALt ALbLA 1 / / b£*U 


t n 


VIA . 

nu : 


/i o t 


PRO^T9AA4l 4fiA / /rc attt apctp apptptptp / /cpn 

JT I\VJP iiUUIilOU / / LwA 1 X X AWVjX LnLU initio/ / O&w 


Tn 


MA » 
NU . 


/i n 9 d 
ft o 


rKUoixUUO D1UU / /ALIA1 1 x I Ai uCjl CAlbbLt/ / bay 


IU 


NO: 


>l A O ft 

4029 


rtA7'\9Pft/ / A c t r ,r v r rfi a r"vr ,f T>r' a'PArrr'Tr* / /cp^ 
f r\UO iZU U / jZoU / / ALol lbALlLlbAlAbLLlb/ / b&U 


1U 


nu : 


40J0 


pro^t9 ni ft 6 n n / / a A r rTp f r r P r PTcar , ii r PTCPT'm / / ce*a 

rrvVJOl -cU 1UDL/DU / / nnl J.U1 11 IbnLAi Ibtl lvj/ / b£*U 


Tn 
1U 


MA • 

nu : 


A A *3 T 

4U Jl 


PPn £ ?T9D union / /nziTa a attt* a rr*!! ananpiT / / crr» 
zrlMJDX tullulZU / / bAlAAAl ± UAoUAAvxAvjUAI / / b£/U 


Tn 
1 u 


MA • 

nu : 




9TCMn^90AQ l Qftft / yA aptptppaptpp&ta aptp / /cs*n 


Tn 

IU 


VTA • 

nu: 


4 U J J 


SMI NT 20024140/ / AAGCCTCTAAAAG TCAACAC/ /SEQ 


ID 


NO: 


4034 


SMINT20 092160/ / TTAAAC AAGTGAGCCTC AGA/ / SEQ 


ID 


NO: 


4035 


SPLEN2004 07 8 0 //TTTCCTGTTTGGTTAGTTTT//SEQ 


ID 


NO: 


4036 


S P LEN2 0110860/ / CTGACGGAAAACTTCTAATT/ / SEQ 


ID 


NO: 


4037 


SPLEN20 177400/ / ATATCTGGTTGTTGGGTTTT/ / SEQ 


ID 


NO: 


4038 


TESTI20038240//G TCTGTCTTGATGGATTGGA/ /SEQ 


ID 


NO: 


4039 
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TESTI20043130/ /AACTATCAGACTGCAAGAGC//SEQ 


ID 


NO: 


4040 


TESTI20046540//GGTAGCCAATAGCAAACAGG//SEQ 


ID 


NO: 


4041 


TESTI20047370//ACGTTGCATAATCCTCAGTC//SEQ 


ID 


NO: 


4042 


TESTI20057200//AGTCCCAGTCTCTAGTTCGG//SEQ 


ID 


NO: 


4043 


TE ST 120057590/ / ACATTTTGGTATTG AC ACTT/ / SEQ 


ID- 


NO: 


4044 


TEST 120113940/ / GTCAGTCCACCTTACTCTTT/ / SEQ 


ID 


NO: 


4045 


TEST 120149880/ /CAAACGATTACG ACACAAAA/ /SEQ 


ID 


NO: 


4046 


TESTI20151800//CGTTCCTCAGGTAGCAAGAT//SEQ 


ID 


NO: 


4047 


TESTI20173050/ / ACATGGCTGAAGGTGATTTT/ / SEQ 


ID 


NO: 


4048 


TE ST 120198600/ / TTTAG AAACATTGGC ATCAG/ / SEQ 


ID 


NO: 


4049 


TEST 1 2 0 2 5 7 9 1 0 / /CTGCCTAGAGT AGAACAAAA/ / SEQ 


ID 


NO: 


4050 


TESTI2026294 0//CTCCCAATCTCAAACACAAG//SEQ 


ID 


NO: 


4051 


THYMU2 0046770/ / CTTCTGCCGAGTTTGTGTAA/ /SEQ 


ID 


NO: 


4052 


THYMU20O5855O//GATGCTGAGAAGGTGTTAGT//SEQ 


ID 


NO: 


4053 


THYMU20062520//AGTCTCAGGATGGGTAAAGG//SEQ 


ID 


NO: 


4054 


TH YMU2 0062770 / / AGAGTTAAGAACCGAGGGAT/ / SEQ 


ID 


NO: 


4055 


THYMU2 0078240/ / CAAGCCAGGGAGATAGACAT// SEQ 


ID 


NO: 


4056 


THYMU20 150190// T ACTACAATGTGGGCTACGG/ / SEQ 


ID 


NO: 


4057 


TRACH2 0125620/ / CC ACATTGTAAAC AGTCCTT/ / SEQ 


ID 


NO: 


4058 


TRACH2 0149740/ / AGATACATTTTCCGTCAAGC/ /SEQ 


ID 


NO: 


4059 


TRACH2 0190460 / /CC AG AAGAGTAGCAAGAATT/ / SEQ 


ID 


NO: 


4060 


UTERU20045200/ /ATTCAACTAAAACAAAGCTG//SEQ 


ID 


NO: 


4061 


UTERU2 0064120 / / AC C C AG AAAAG AG ATG AGAA/ / SEQ 


ID 


NO: 


4062 


UTERU2 0103200/ / CTGTTCCTGGCAAATAAG AG/ / SEQ 


ID 


NO: 


4063 


ADRGL2 0046760/ / ATGTGAAGGAATGATGTACT/ / SEQ 


ID 


NO: 


4064 


ASTR02 0055530/ /GAATAATGAAG GGG AC CAGA/ / SEQ 


ID 


NO: 


4065 


BRAMY2 0076130/ /CCTTTCATGTCTCAGTATTT/ / SEQ 


ID 


NO: 


4066 


CTOm20 170940/ / ATACGTCAGAGGACACATGC/ / SEQ 


ID 


NO: 


4067 


FCBBF20 033360/ / TCCGT AGC AGT AG AAACATC/ /SEQ 


ID 


NO: 


4068 


FCBBF3 0257370 / /CAGGTATGCTTGGAGTTTCA// SEQ 


ID 


NO: 


4069 


FCBBF50 00 1650/ / CG TG AT TAGGAAGGG AC AGT/ / SEQ 


ID 


NO: 


4070 


FEBRA2 0040290/ /CCCAAG AACGAAAC AAAACT/ / SEQ 


ID 


NO: 


4071 


FEBRA2 0063720 / / AAT AATGCCCACCTATAAAA/ / SEQ 


ID 


NO: 


4072 


FEBRA2 009 BO 40/ /GAGGGGAATTGTCAGTACAC//SEQ 


ID 


NO; 


4073 


FEBRA2 0108580/ /TC ATTTTGTCT AGTGCCCAT/ / SEQ 


ID 


NO: 


4074 


ME S AN2 0 021860/ / AC ACATTCCCATCAATAGGT/ / SEQ 


ID 


NO: 


4075 
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ME SAN20 067430/ / AGCTAAGGAGGTTTTCACAT/ / SEQ 


ID 


NO: 


4076 


5 


NT2NE20 045190 / / GGAATATGTTGGGCTAGTTA/ / SEQ 


ID 


NO: 


4077 




PROST20 016760/ / AACTTCATCCATTCCAACTG/ / SEQ 


ID 


NO: 


4078 




SKNSH2 0 00 7 160// TTAAACCAACATTGAGGAAA/ / SEQ 


ID 


NO: 


4079 


10 


SMINT20006020//CTTGGTTGTCCCCTTTCTAG//SEQ 


ID- 


NO: 


4080 


TESTI20059370//GGCTGACrTTTCTCCTACAG//SEQ 


ID 


NO: 


4081 




TESTI20103690//CCACTTTATTTCTCCTCCCT//SEQ 


ID 


NO: 


4082 




TE ST 120254480// GTGGACACAACTTGCTTTAC / / SEQ 


ID 


NO: 


4083 


15 


THYMU1 00 042 8 0 / /GAGAGTCTGCCTAGCTGTGT/ / SEQ 


ID 


NO: 


4084 




THYMU20 030460/ / GCCCATGTGAGTAGGTGTAG// SEQ 


ID 


NO: 


4085 




TRACH2 0090060 / / AGGGCCAACTTAAATCTCTG/ / SEQ 


ID 


NO: 


4086 


20 


UTERU20 041970/ / GATAAACCCC AAACATGAAA/ / SEQ 


ID 


NO: 


4087 




BRAMY20 125360 / /GACAGACTAAAACGTTGAGC/ / SEQ 


ID 


NO: 


4088 




OCBB F2 0 142290/ / CCAAGGAG AGTC AGTG AC AG / / SEQ 


ID 


NO: 


4089 


25 


SKMU S 20006790/ / TTCAAAAGC AGAGACTCCCT //SEQ 


ID 


NO: 


4090 


TESTI20030610/ / TGAACTCAGTACCAGGCTTG / / SEQ 


ID 


NO: 


4091 




UTERU2 0 026620 / /GAG ATTCCCTAGTGGTGGTT/ / SEQ 


ID 


NO: 


4092 



[0305] The result of expression frequency analysis is shown in Table 40. The clones not shown in the table contain 
30 clones whose expression levels could not be measured because the levels were too low or the sizes of the PCR 
products were different from the expected. It was confirmed that the expression levels of IL-8 gene used as a positive 
control gene were elevated. 

[0306] The result obtained by the search for the genes whose expression levels were altered depending on the 
presence of TNF-a in culturing THP-1 cell, which is a human monocyte cell line, showed that the clones whose ex- 
35 pression levels were elevated by twofold or more one or three hours after the stimulation (the clones whose expression 
levels were 0.1 or lower both before and after the stimulation were excluded), were 

ASTRO20055530, ASTRO20055930, ASTRO20088950, BRAMY20027390, BRAMY200761 30, BRAMY2011841 0, 
B RAM Y20 125360, BRAMY20237190, BRCAN20001480, BRHIP1 0000720, CD34C20001750, CTONG20078340, 
CTONG20085210, DFNES20063460, FCBBF20029280, FCBBF20033360, FCBBF30078600, FEBRA20007820, 

40 FEBRA20031280, FEBRA2003181 0, FEBRA20040290, HLUNG20041540, HLUNG20092530, MESAN20021860, 
MESAN20067430, MESAN20084150, NT2NE20092950, NT2RP70031070, OCBBF20012520, OCBBF201 42290, 
OCBBF201 65900, OCBB F20 170350, OCBBF201 76650, PLACE60006300, PROST20011160, PROST201 06060, 
SPLEN20040780, SPLEN201 10860, S PL EN201 77400, TESTI2003061 0, TEST 1200431 30, TESTI20059370, 
TESTI20254480, THYMU 10004280, THYMU20030460, THYMU20062520, THYMU20078240, THYMU20150190, 

45 TRACH20090060, TRACH201 25620, UTERU20026620, UTERU20045200, UTERU20064120, UTERU201 03200. 
[0307] On the other hand, in particular cases where the expression levels were relatively high in the unstimulated 
cells (the relative value was 1 or higher), the clones whose expression levels were decreased by twofold or more by 
the TNF-a stimulation (the clones whose expression levels were increased 1 or 3 hours after the stimulation were 
excluded) were 

50 BNGH420052350, BRACE20052530, BRAMY20003880, CTONG201 70940, FCBBF30022680, FCBBF30225930, 
FCBBF30257370, FEBRA20046280, KIDNE20084030, KIDNE201 88630, NT2NE20082130, OCBBF201 10210, 
PLACE60061370, PROST20041460, PROST20075280, PROST20110120, SMINT20006020, TESTI20046540, 
TESTI20057200, TESTI201 13940, TESTI20257910, TESTI20262940, TRACH201 49740. 
[0308] These clones were thus revealed to be involved in the inflammation reaction induced by TNFkx. 

55 [0309] The result obtained by the search for the genes whose expression levels were altered depending on co- 
culturing gastric cancer cell line MKN45 with cag PAI positive Helicobacter pylori (TN2), showed that the clones whose 
expression levels were elevated by twofold or more (the clones whose expression levels were 0.1 or lower both before 
and after the stimulation were excluded), were 
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BRAMY20028530, BRAMY20035380, OCBBF201 70350, PROST20011160, SKMUS20091900, SPLEN20040780, 
THYMU20078240, TRACH201 90460, UTERU20045200, UTERU20064120, ASTR020055530, CTONG20 170940, 
FEBRA20040290, MESAN20067430, PROST20016760, THYMU1 0004280, TRACH20090060, UTERU20041970, 
OCBBF201 42290, TESTI20030610. 

[0310] Of these clones, the expression levels of BRAMY20035380, SKMUS20091900, SPLEN20040780, 
UTERU20064120, CTONG201 70940, OCBBF201 42290, TESTI2003061 0 were not increased by the co-culture with 
the cagE mutant (TN2AcagE). There may be the possibility that the expression levels of the 7 clones are altered via 
the NF-kB pathway. Among them, the expression levels of OCBBF201 42290, SPLEN20040780, TESTI20030610, 
UTERU20064120 were also increased when human monocyte cell line THP-1 was stimulated with TNFoc. 
[0311] On the other hand, in particular cases where the expression levels were relatively high in the unstimulated 
cells (the relative value was 1 or higher), the clones whose expression levels were decreased by twofold or more in 
the presence of Helicobacter pylori were 

ASTRO20088950, BRACE20052530, BRAMY20003880, BRAMY20027390, BRAMY20036530, BRAMY201 18410, 
BRHIP20000210, FCBBF20032930, FCBBF30022680, FCBBF301 69870, FEBRA201 82030, KIDNE201 82540, 
LIVER20007750, MES AN 2002 1220, NT2NE20059210, NT2NE20082130, OCBBF20 155030, PROST20065100, 
PROST20075280, SPLEN201 10860, TESTI20057200, TESTI201 13940, TESTI201 49880, TESTI20151800, 
TESTI201 98600, TESTI2025791 0, THYMU 20046770, THYMU 20058550, THYMU20150190, FCBBF20033360, 
FCBBF30257370, FEBRA20098040, SMINT20006020. 
[0312] These clones are involved in gastritis or gastroduodenal ulcer. 



Table 3 



Clone ID 


CD34C 


D30ST 


D60ST 


D90ST 


ASTRO200 10290 


0 


55.437 


0 


0 


BRAMY20036530 


0 


33.144 


0 


0 


BRAMY20043630 


0 


0 


0 


13.575 


BRAMY20089770 


0 


0 


0 


63.803 


BRAMY201 90550 


0 


0 


0 


31.089 


CD34C20001750 


100 


0 


0 


0 


FCBBF20066340 


0 


0 


0 


76.503 


FEBRA20040290 


0 


0 


0 


14.912 


HLUNG20015180 


0 


0 


0 


8.491 


HLUNG20041590 


0 


0 


0 


18.349 


HLUNG20052300 


0 


36.241 


0 


0 


KIDNE20084040 


0 


0 


0 


65.916 


MESAN20021860 


0 


0 


0 


56.046 


MESAN20027240 


0 


0 


0 


33.731 


NTONG20055200 


0 


0 


0 


19.731 


PROST200 16760 


0 


0 


0 


15.442 


PUAEN1 0001 640 


0 


34.074 


0 


0 


SMINT20006020 


0 


7.702 


0 


9.021 


SMINT20028840 


0 


68.605 


0 


0 


SMINT20035050 


0 


9.417 


0 


11.029 


SPLEN20181570 


81.506 


0 


0 


0 


TESTI20064530 


0 


0 


77.29 


0 


TESTI20210030 


0 


0 


0 


90.471 


THYMU20029830 


0 


42.091 


0 


0 


THYMU20139160 


0 


0 


0 


3.486 


TRACH20051590 


0 


0 


0 


75.217 



Table 4 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


ADRGL20023920 


0 


6.69 


0 


0 
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Table 4 (continued) 





Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 




ASTR020009140 


o 


o 


13.389 


0 




BNGH420077980 


0 


o 


20.754 


o 




BNGH420086030 


o 


o 


o 


13.322 




BRACE20062580 


0 


o 


3.585 


2.717 




BRACE20079370 


0 


o 


9.312 


14.111 


10 


BRACE202 15410 


o 


o 


27.196 


n 
u 


BRAMY20003540 


o 


o 


3.208 


4 8R1 




BRAMYP0043630 


o 


o 


2.408 






BRAMY20076130 


o 


o 


fi R01 


n 




BRAMY20095080 


o 


o 


25.606 


o 


15 


BRAMYP0PP78R0 


n 
\j 


P 7Q8 


R 17 


o.oyo 




BRAWHP008P550 






n 


Q 7^8 




BRHIP1 0001 040 

L_F 1 11 III 1 \J\J\J 1 Wtw 


o 


0 


A 
VI 


p ^P1 




RRSSNIP0005R1 0 




OO QQQ 


u 


U 




CTONGP00P766Q 


o 


o 


en R4P 


A 

u 


20 


CTONGP0044P30 


o 




Cf| R4p 


A 
U 




rrroNGPOORRi 1 n 


n 

u 




\ o. I o [ 


u 




nTONG?007QRQ0 


a 

u 


14 PQ7 


pq 1 np 


U 




C:TONGPOOR4fifiO 








n 
u 


25 






O.OO I 


u 


Q 11 1 






a 

u 




u 


n 
u 




CTONGP01 88080 


o 


10.319 


n 


A 

u 




FPRRFpnnp^Qn 


n 
u 


7 Q7R 


n 


/ .OoO 




FCRRFPnm^fin 


n 

V 


u 


O.U 1 o 


u 


30 


FPRRFPOO^QfifiO. 


u 


u 


u 






FfRRF2nn7nQ^n 


a 


u 


1 O.OO/ 


u 




FCRRF ^000434/1 


a 

\j 




n 

u 


8 778 
O.f to 




FCRRF^00Q^41 0 


a 
\j 


7Q P15 


A 
U 




35 


FCRRFIOI P^4fifl 


a 
u 




A 
U 


Q 101 




FHRRF^OI 7Q1 80 


n 

u 


u 


c>R 41 A 
00,4 I o 


A 




FPRRF^n9^fiR7n 


u 


1 qpe 


4 4Q4 


9 79,4 




FCRRF^0PS7^70 


n 


1Q RPP 




n 
u 




FCRRFR0000R1 0 


n 




Rfi 418 

OO.H 1 O 


A 

u 


40 


FCRRF50001 R50 
i vuur juyu iuju 


o 




n 


PP 181 




FEBRA20038330 




1fi 1P4 


n 
u 


A 
V 




FFRRAPOO^QPRO. 


n 


7 1RP 


P 4^ 


A 

u 




FFRRA?nnfi?7?n 


n 


IO OCR 






45 


FFRRAP00QDPP0 


n 

u 


4 ROP 


9 1PP 


4 79P 


FFRRAP01 ^fJ4Pn 


n 

V 




^1 PR 


n 




HEART1 0001 4Q0 


o 


o 


Q P7 


o 




HLUNGP00'3P4fin 


n 


n 


n 


P1 P78 




HLUNGP0041 ^Q0 


n 




n 


4 Q?P 


50 


KIANE20089870 


0 


0 


0 


4.145 




MESAN200 16270 


0 


0 


39.208 


0 




MESAN20021860 


0 


0 


0 


7.532 




MESAN20060430 


0 


0 


24.385 


0 


55 


MESAN20067430 


0 


0 


0 


9.568 


NT2NE20018740 


0 


0 


0 


100 




NT2NE20018890 


0 


0 


0 


100 




NT2NE20021860 


0 


0 


0 


100 
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Table 4 (continued) 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


NT2N E20026200 


o 


o 


0.808 


P 44 Q 


NT2NE2002651 0 


o 


o 


n 


1 on 

1 VJVJ 


NT2NE 20028700 


o 


o 


o 


pp PPT 


NT2NE 20033 150 


o 


o 


0 


50 1TT 


NT2NF20037050 


o 


o 


n 

vj 


1 on 

I VJVJ 


NTPN FP00T8870 

IN 1 ClMLcVUOOOf U 


n 
\j 


o 


VJ 


1 nn 

I VJVJ 


NTPNFP00TQP1 0 


n 

VJ 


o 


A 
V 


1 nn 

I VJVJ 


NT2NF20042550 


o 


o 


o 

VJ 


5Q OR*3 


NTPNF200451 Q0 


o 


o 

VJ 


5 05 


1 5 °,Ofi 
I o.ouo 


NTPNFP0047870 

IN 1 ClNLCUUtf O/ \J 


o 


vj 


A 
VJ 


mn 

1 VJVJ 


NTPN FP0053P30 


V 


n 
vj 


A 
VJ 


1 nn 

I VJVJ 


NTPNFPnn5TQ50 

IN 1 cINCcUUJOaJU 


n 


on QRQ 


A 
U 


P1 544 


NTPNFP005QP1 0 

IN 1 bliLbUUJ9£ 1 \J 


o 


vj 


A 
VJ 


1 nn 

I VJVJ 


NTPNFP0n5QR8n 


n 
yj 


A 
U 


A 
U 


1 nn 

l VJVJ 


NTPNFP00fi0750 




vj 


A 
VJ 


mn 

I VJVJ 


NTPNFP00fi1 0^0 

IN 1 blNt^UUU 1 UOu 


vj 


TQ 18 

OS. 1 O 


A 
VJ 


4n pp4 


NTPNFPOnfiPSRn 

IN 1 cINllUUOcOOU 


u 


u 


A 
U 


1 nn 

1 UU 


IN 1 cIMlcUUOH/ OVJ 


a 
u 


u 


A 
U 


1 nn 
1 uu 


NTPNFPnnfifi5Qn 


n 
u 


u 


u 


1 uu 


NTPNFPnnfiQ5ftn 
In 1 cIMC^UUOaOOU 


u 


n 
u 


A 

u 


CQ AQQ 

oy.uoo 


NTPNFPn07n59n 




n 
u 


A 
VJ 


a nn 
1 uu 


NT9NFPnn7'}fi5n 

IN 1 cINC^UU/OQOU 


a 
u 


n 
u 


A 
U 


i AA 

1 uu 


NTPN FPnn77P5n 

IN 1 ^INr^UU/ rcOU 


a 
u 


n 
u 


A 
U 


i AA 

1 uu 


NTPNFPn077P7n 


a 
u 


n 
u 


A 
U 


1 AA 
1 UU 


In 1 tINCtUU/ / oOU 


n 
u 


u 


A 
U 


■i AA 
1 UU 


NTPN FPn07Qfi7n 

IN 1 cIMLcUU/yO/U 


a 

VJ 


A 


A 
VJ 


1 nn 
1 uu 


ntpn Fpnnan77n 

IN 1 dIMCcUUDUf / u 


n 
u 


A 

u 


A 
U 


■i AA 

1 uu 


IN 1 cLIN xZ.c.\J\J Oc. 1 OU 


a 


A 

u 


A 
VJ 


Tfl no 


NTPWFpnnApfion 

IN 1 tIMCcUUOcQUU 


n 
u 


A 

u 


A 
VJ 


DD.cOO 


IN 1 ^lNd<£UUOOUf U 


u 


A 
U 


A 
VJ 


•i AA 
1 UU 


ntpkj Fpnnft7P7n 

IN 1 *1IN CcWO/ C.I \J 




A 

u 


A 
VJ 


1 nn 
1 uu 


MTPNJFpnnft7fl^n 

In 1 ^INC^UUOf OOU 


u 


A 

u 


A 
VJ 


1 nn 
1 uu 


IN 1 cnCcUUOOvOU 


vj 


A 
VJ 


A 
VJ 


fin 4P5 


NTPN FPnOQPQ^n 

IN 1 CliLcUV9c9JU 


n 


pn pqr 


A 
VJ 


pn A7 


NTPN F9nnQ5P°.n 


u 


A 
VJ 


A 
VJ 


EC •( Q ft 

DO. ) OO 


NTPNFpm 04nnn 

IN 1 £INC£VJ 1 UHUUU 




A 

VJ 


A 

VJ 


■i nn 
1 uu 


NTPNFP01 0781 0 

IN I £iNC_£VJ iU/O 1 U 


u 


A 
VJ 


A 
VJ 


1 nn 
I uu 


NTPNFP01 084P0 

IN 1 £INCZ£U 1 UOtcv 


vj 


A 
VJ 


A 
VJ 


45 An7 


NTPNFP01111Qn 

IN 1 cINCCV Mil J7VJ 


vj 


A 
VJ 


A 
VJ 


51 PflQ 


NTPNFP011PP10 

IN 1 *1IN L^IVJ I | ^£ | v 


vj 


A 
VJ 


A 
VJ 


Pfi fifiQ 


NTPNFP01 1 4R5n 

IN I £INC*LU 1 1 4 OOU 


n 


A 
VJ 


A 
VJ 


1 nn 
l uu 


NTPNFP01 1 7580 

IN 1 C.\HC£.\J 1 1 / wOVJ 


n 

VJ 


A 
VJ 


A 
VJ 


1 nn 

I VJVJ 


NT2NE201 19980 


0 


0 


0 


100 


NT2NE201 23610 


0 


0 


0 


100 


NT2NE201 24570 


0 


0 


0 


100 


NT2NE201 26030 


0 


0 


0 


100 


NT2NE201 27900 


0 


0 


16.514 


25.025 


NT2NE201 40130 


0 


0 


0 


100 


NT2NE201 40280 


0 


0 


0 


100 


NT2NE20141040 


0 


0 


0 


36.013 
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Table 4 (continued) 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


NT2NE201 45250 


0 


0 


o 


100 


NT2NE20 146510 


0 


0 


o 


100 


NT2NE20 148690 


0 


0 


o 


67.567 


NT2NE20 149500 


0 


0 


o 


100 


NT2NE20 150610 


o 


o 


o 


100 


NT2NE20 152620 


0 


o 


o 


100 


NT2NE20 153620 


o 


o 


o 


60 858 


NT2NE20 155650 


I 0 


o 


o 


100 


NT2NE20157120 


i 0 


o 


o 


100 


NT2NE20165190 


| o 


o 


o 


100 


NT2NE20 167660 


o 


o 


o 


1 00 


NT2NE20 173970 


o 


o 


o 


100 


NT2NE201 77210 


o 


o 


o 


43 734 


NT2NE20181760 


o 


o 


o 


100 


NT2NE20181800 


o 


o 


o 


46.215 


NT2NE20 184720 


o 


o 


o 


100 


NT2R I200 16240 

111 *— 1 11^ W 1 UL~V 


o 


o 


100 


o 


NT2RI20021200 


o 


q 


8.844 


A 
\j 


NT2R 120033920 


o 


o 


100 


A 
u 


NTPRIPOOQ^OIO 




A 

yj 


1 00 


u 


NT2RP70001120 


o 


32.573 


o 


0 


NT2RP 7000 1730 


o 


15.462 


o 


o 


NT2RP70003110 


o 


24.333 


o 


A 
\j 


NT2RP7001 28^0 




O .UOC7 


a 

\j 


R 7Qfl 

O. / oo 


NT2RP700228P0 


o 




A 
\j 


U 


NT2RP700277Q0 

11 i CI 11 / U \JC- i / £7 w 


o 


60.1 94 


fl 


u 


NT2RP70029780 


o 


100 


o 


o 


NT2RP70030840 

111 w 1 11 f Wwvl^Tw 


o 


100 


o 


fl 


NTPRP70011 070 




1 no 


A 
\j 


fl 


NT2RP70031340 


o 


100 


0 


fl 


NT2RP70031480 


o 


100 


n 
\j 


fl 


NT2RP70035110 


o 


23.442 


o 


fl 


NT2RP7004641 0 

1 1 1 CI 1 1 IV \J^\J^ 1 V 


o 


64.358 


o 


fl 


IMT2RP7004961 0 

111 bl 11 / wwTWv 1 V 


o 


100 


o 


o 


NT2RP70056290 

ii i ci ll r w«JUCvU 


o 


100 


o 


o 


NT2RP70056690 

11 1 CI 11 f VVJUUvU 


o 


100 


o 


fl 


NT2RP70057500 

111 bl 11 / wvw f **J\J\S 


o 


13.6 


o 


o 


MT2RP70064570 1 

1 1 1 CI 11 f UUUtv f V 


o 


60.194 


o 


o 


NT2RP70074800 


o 


100 


o 


o 


NT2RP70075300 


o 


6.726 


o 


0 


NT2RP70075800 

1 « 1 till 9 W 9 w \J\J\J 


o 


100 


o 


o 


NT2RP70080150 


o 


100 


o 


o 


NT2RP70084540 


0 


100 


0 


0 


NT2RP70087140 


0 


7.444 


15.153 


7.654 


NT2RP70090870 


0 


17.122 


11.617 


0 


OCBBF20001780 


0 


0 


15.351 


0 


OCBBF20009820 


84.348 


0 


0 


0 


OCBBF201 42290 


0 


0 


0 


14.577 


OCBBF201 55030 


0 


49.439 


0 


0 


OCBBF201 75360 


0 


0 


0 


20.881 
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Table 4 (continued) 



Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


OCBBF20 177540 


o 


25.509 


o 


o 


OCBBF201 77910 


o 


o 


23.505 


o 


PLACE 60054820 


0 


0 


20.089 


0 


PLACE60061370 


o 


18.07 


0 


0 


PLACE60073090 


0 


10.617 


0 


o 


PLACE60162100 


o 


o 


o 


37.629 


PROST20011800 


o 


o 


o 


50.542 


PROST20045700 


o 


o 


o 


44.59 


PROST20078710 


o 


o 


o 


33.633 


PROST20094000 


0 


0 


18.34 


o 


PUAEN 10000650 

1 f\ 1— r >■ I wwwwwwv 


o 


3375 


o 


o 


PUAEN 10001640 

1 V/» L_ 1 V I w W w 1 w"Tw 


o 


5.216 


3.539 


o 


SKNMC20006350 

1 XI H 1 VI W£> Wis Www V * 


o 


2.005 


2.041 


4.124 


SMINT20016150 


o 


49 972 


0 


n 
yj 


SMINT20030740 


o 


5.712 


yj 


n 
yj 


SMINT20035510 


o 


o 


28.984 


o 


SMINT20039050 


o 


0 


1 ft R7fi 


Q ^ft^ 


SM 1 NT20047?Q0 


n 
yj 


OS.O 




u 


£PLFN?00fi^ oc ,0 




ft 0R1 


ft 


ft *V)Q 


SPLEN20117580 


o 


o 


yj 


OA AfiO 


SPLFN201 25230 


o 




21 




TESTI2003Q610 


o 




o 


1? °05 


TFSTI?004 Q .Q1 0 


yj 






AC Geo 


TFSTIPOOfifiPRO 


n 
\j 






ou.o / □ 


TFSTI20067480 




ft RR1 




u 


tf^tipoiosi ^o 


u 


u 


u u 




TFSTI20106170 


o 


1°. 1 04 

1 O. 1 U4 


yj 


n 
u 


TESTI20143180 


o 


o 


62.741 


n 


TFSTI202217Q0 


o 


n 


1R7 


n 

u 


TFSTIPOP^OQO 


n 
\j 




u 


1 Q OR^ 


TFSTIP0P74Q6O 






QD ROC 


u 


THYMU1 0004280 


yj 


^ ^7^ 




n 

u 


THYMU20007020 


yj 


n 


71 017 


u 


THYMU201 04480 


o 


o 


29.694 


n 


THYMU20139160 


0 


1.822 


0.618 


0 


TRACH20026640 


0 


0 


7.476 


0 


UTERU1 0001 060 


0 


0 


19.967 


0 


UTERU20026620 


0 


10.41 


0 


10.703 


UTERU20079240 


0 


0 


0 


19.734 


UTERU20083020 


0 


7.182 


0 


0 


UTERU201 02260 


0 


0 


0 


23.706 


UTERU201 32620 


0 


48.06 


0 


0 



Table 5 



Clone ID 


BEAST 


TBAES 


CTONG20070780 


0 


97.283 


CTONG20084660 


0 


89.108 


HLUNG20045340 


0 


85.362 
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Table 5 (continued) 



Clone ID 


BEAST 


TBAES 


TESTI20047~370 


0 


71.55 



Table 6 



Clone ID 


CERVX 


TCERX 


SMINT20030740 


0 


65.795 


Table 7 


Clone ID 


COLON 


TCOLN 


UTERU2004037 


68.149 


0 


Table 8 


Clone ID 


NESOP 


TESOP 


HLUNG20015180 


51.695 


0 


NESOP20004520 


100 


0 


NESOP20005040 


100 


0 


TESOP1 0000350 


0 


86.127 


TESOP1 0001 600 


0 


100 


THYMU20071120 


0 


81.712 



Table 9 



Clone ID 


KIDNE 


TKIDN 


ASTR020009140 


19.518 


0 


ASTR020027330 


0 


30.903 


ASTR020055930 


0 


36.981 


BGGI120010750 


4.532 


0 


BNGH420074600 


0 


11.358 


BRACE20050870 


0 


15.884 


BRACE20054480 


29.719 


0 


BRACE20062580 


2.613 


0 


BRACE20219360 


59.494 


0 


BRAMY20003540 


4.676 


0 


BRAMY20003880 


16.882 


41 .527 


BRAMY20043630 


3.51 


0 


BRAMY20055760 


0 


65.196 


BRAMY201 25360 


0 


32.672 


BRAMY201 90550 


0 


19.772 


BRAMY20204270 


3.618 


0 


BRAMY20227860 


0 


4.255 


BRAWH20014590 


0 


23.509 


BRAWH20093070 


0 


14.759 


BRHIP1 0001 040 


0 


5.496 


CTONG20033750 


59.93 


0 


CTONG20039370 


59.93 


0 


CTONG20045500 


59.93 


0 
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Table 9 


Icontinued) 




Clone ID 


KIDNE 


TKIDN 


CTONG20079590 


0 


17.392 


FCBBF20023490 


14.59 


o 


FCBBF30004340 

1 w?LJL*H WWw~W~w 


o 


10.386 


FCBBF301 06950 


0 


69 888 

w W ■ WWW 1 


FCBBF301 15230 


0 


69 888 

W \J * W W Kj 


FCBBF30 169280 


0 


38.857 


FCBBF30225930 


o 


50 898 


FCBBF30282020 


o 


53.714 


FEBRA20038330 

] L.Lf 1 w **/w www W 


o 


6.538 


FEBRA20039260 


7.084 


o 


FEBRA20040290 


7.711 


o 


FEBRA20082660 


0 


77.359 


FEBRA20121200 


o 


63.078 


FEBRA20 170240 

1 W 1 / Wt_~TW 


o 


63.078 


HEART1 0001490 


20.269 


o 


HLUNG20041590 


4 744 


o 


HI UNGPOOfiRlPO 


11 R4 


u 


HLUNG20072450 


3.599 


o 


HI UNGP0OR14RO 


0 
u 


1<> 004 


HLUNGP00R1Q60 


1fl Q4fi 


u 


KIDNFP0011600 


f uu 


u 


KIDNE2Q016360 




o 




100 

1 UU 


u 


KIDNIFP00P7QR0 

f\ 1 L/ 1 N CxUUt / 07OU 


100 
1 uu 


n 

U 


KiDlvJFPOOROfiQO 


5.861 


o 


KinNIFP00R1170 


1 00 


u 


KIDNFP00R11 RO 


1 00 
1 uu 


u 


KIDNF200836P0 


100 


u 


KIDNF200R4010 


40 01 


u 


KinNFPO0R404o 


14 0R4 


u 


KIDNFP00R4710 


100 
1 uu 


u 


kidnfpoor4roo 


1 00 
1 uu 


KJ 


KIHNFPOORfidQO 


R7 fi1 


u 


KIDNF2G0R66R0 


47 013 


n 
u 


KIDNFP00R6Q70 


100 
1 uu 


u 


KIDNFP0087R80 


28.683 


u 


KIDNE20088P40 


100 


u 


KIDNE2008Q870 


3 987 


o 


KIDNF200Q10Q0 


100 


n 

u 


KIDNE20094260 


100 


o 


KIDNF20094670 


59.494 


n 
u 


KIDNE20095530 


100 


o 


KIDNE201 33460 


100 


o 


KIDNE201 33880 


100 


0 


KIDNE20134130 


65.363 


0 


KIDNE201 34890 


100 


0 


KIDNE201 37310 


100 


0 


KIDNE201 38450 


38.971 


0 


KIDNE201 40870 


22.93 


0 


KIDNE20141120 


100 


0 
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Table 9 


(continued) 




Clone ID 


KIDNE 


TKIDN 


KIDNE20141700 


100 


o 


KIDNE201 42680 


100 


o 


KIDNE201 42680 


100 


o 


KIDNE20142900 


31 .732 


o 


KIDNE201 43200 


100 


o 


KIDNE20147170 


100 


n 

u 


KIDNE20148080 


100 


o 


KIDNE201 49780 


60 365 


o 


KIDNE201 50730 


100 


o 


KID NE20 152440 


100 


o 


KIDNE201 54330 


100 


n 
u 


KIDNE20 154830 


100 


u 


KIDNF201 55980 


100 


ft 
u 


KIDNFP0157100 


100 


ft 
u 


KIDNF201 60^80 


100 

1 uu 


ft 
u 


KIDNE201 60960 


100 


o 


KIDNFPG16371 0 


100 
i uu 


ft 
u 


KIDNE201 65390 


100 


ft 
u 


KIDNFP016Q1R0 


100 


ft 
u 


KIDNFP0 170400 


1Q <^R 


ft 
u 


KIDNIFP01731R0 


100 
i uu 


ft 
u 


KIDNIFP0 173430 


JO.U 1 o 


ft 
u 


KIDNIFP01 7fimn 


1 00 


ft 
u 




1 00 
1 uu 


ft 
u 


KIDNFP01RPR40 

i\ I 1 ^ C_/_ \J I O*10*+U 


1 00 
I uu 


ft 
u 


KIDNFP01RR170 


1 00 
1 uu 


ft 
u 


KIDNFP01 RRR10 
rv i l/ i n lc_u i oouou 


1 0O 
I uu 


ft 
u 


KIDNFP018QRQ0 


100 
1 uu 


ft 
u 


KIHNFP01 RQQRO 


1 00 
I uu 


ft 
u 


KIDNFP01Q1870 

1 \ II-/ 1 tI I — C—\J 1 v7 1 U / U 


100 
1 uu 


ft 
u 


MFSANP003fl c P0 


ft 
u 


11 ^Rfl 

I I .030 


MFSANPG041 ^80 


ft 
u 


CO ROC 


OCRRFP001 fi^QO 


ft 
u 


P°, P11 


OCRRFP01 4PPQ0 


o 


R RP^ 


OCRRFP01 748Q0 


^P P41 


ft 
u 


PLACF60061370 


u 


P1 QR1 


PLACER 0073090 


10.501 


ft 

u 


PLACF601 81 R70 


4Q QP1 


ft 
u 


PROST2001 6760 


3.992 


Q RP 


PUAEN 10000650 


6.676 


o 


SMINT20039050 


o 


11 .102 


SMINT20089210 


o 


12.885 


SPL FN20017610 


42.429 


o 


SPLEN20024930 


o 1 


22.541 


SPLEN20057830 


0 


35 


SPLEN20063250 


3.996 


0 


SPLEN201 26110 


50.05 


0 


SPLEN201 35030 


31 .695 


0 


SPLEN201 36700 


0 


39.807 


TESTI20070740 


0 


75.118 
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10 



Table 9 


continued) 


Clone ID 


KIDNE 


TKIDN 




Ah oft 


U 


TH YM U2000Q500 


0 


24.928 


THYMU20019260 


0 


36.869 


THYMU201 57620 


0 


19.973 


TKIDN 10000620 


0 


100 


TKIDN1 0001 710 


0 


100 


TKIDN1 0001 920 


0 


36.869 


TRACH20011010 


10.647 


0 


UMVEN10001380 


0 


4.196 



15 

Table 10 



20 



25 



30 



35 



40 



45 



Clonp ID 


LIVFR 


Tl l\/P 


CTONG20069320 


20.862 


0 


FCBBF30236670 


3.257 


0 


FEBRA20038220 


77.547 


0 


FEBRA20039260 


8.806 


0 


KinNF?00fl7flfl0 


71 .317 


ft 


LIVER20006260 


100 


0 


LIVER20007690 


85.922 


0 


LIVER20007750 


100 


0 


LIVER20010510 


100 


0 


LIVER20010760 


100 


0 


LIVER20010990 


100 


0 


LIVER20011640 


100 


0 


LIVER20013890 


100 


0 


LIVER20026440 


100 


0 


LIVER20030650 


100 


0 


LIVER20032340 


100 


0 


LIVER20038000 


100 


0 


LIVER20040740 


100 


0 


LIVER20055270 


100 


0 


MESAN20027240 


21.684 


0 


NT2RI20021200 


16.028 


0 


SKMUS20006790 


6.063 


0 


TESTI20035330 


22.045 


0 


THYMU 10004280 


8.787 


4.346 


THYMU20029830 


31 .692 


0 



Table 11 



50 



Clone ID 


HLUNG 


TLUNG 


HLUNG20052300 


23.611 


0 


SMINT20035050 


6.135 


0 


HLUNG20041590 


10.207 


0 


PROST20016760 


8.589 


0 


BRAMY20043630 


15.102 


0 


HLUNG20015180 


4.723 


0 


THYMU20139160 


1.939 


0 
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Table 11 


(continued) 






Clone ID 


HLUNG 


TLUNG 




hi i JNra>nn?nR^n 


fi7 dfifl 


n I 


5 


HI I J Nfi?nrn 94 RO, 




n 
U 




RRAMY9n?f)497n 

DnnlVI I iUcUHt / V 


7 785 


n 




RRAMYPnnm 




n 




RRAMY9n?97ftfif) 

Dit/MVI T C.\)tLC. 1 ODU 


1 .HOO 


n 
u 


10 


nTONnpnn^Qn^n 


oa cn4 


n 
U 


nTONin^m fiftd.fi n 

w 1 UIMUtU 1 OOH-OU 


7fi 9Q1 


n 






fil fi7 
0 1.0/ 


n 
U 




FFRRApnnQQOfin 


7 ft9 
/ .0^ 


n 
U 




FPRRA9nn7RRnn 

rCDnntUU/ oouu 


OO.DOO 


n 
u 


15 


rCDnMZU 1 OoyoU 


oo 007 


n 
U 




nonuiN^uuuuo/u 


oq oqo 
do.doo 


n 
U 




hi i iNf^onnna/ifin 

nLUINo/LUUU040U 


K7 

0/ .04 


n 
U 




HI 1 INIfiOnnnQOftn 


1 nn 
1 uu 


n 
U 




HI I iMnonnnQKKn 

n LU IN O/lUUUyOOU 


1 nn 


n 
U 


20 


wi MNf^onm n-i Qn 

nLUINCfccUU 1 U 1 oU 


i nn 
l UU 


n 
U 




Hi 1 iwr^onnnofin 

nLUINo<£UU I I^OU 


i nn 
i UU 


n 
U 




n LU IN Li^UU 1 1 44U 


l UU 


U 




hi I iwnonnn/ifin 
nLUiNo^uui l 4bU 


7fi C77 
/O.O/ / 


n 
U 


25 


nLUlNV3^UUl ^ I 4U 


^ nn 
I UU 


U 




wi I iMfiOfifM /ik on 
nLuiNLa^iuui 4oyu 


Oft rt/lK 

OO.U40 


U 




wi I iM^onni Rnvn 

rlLUiNva^UUl uU/U 


1 7 on/i 


U 




wi i iMf'ionnonKnn 

rlLUINiJ^UU^UOUU 


•i nn 
1 UU 


U 




wi i iMf^onno-MKn 


eo nnc 
bo.UUb 


U 


30 


wi 1 iMfionnoQnQn 

rlLUIN vj^UU^OUijU 


■< nn 
1 UU 


U 




nLUiNv3^UU<£4U0U 


h nn 
l UU 


U 




nLUNo^UU^ODiiU 


^ nn 
I UU 


U 




r1LUlNLi*iUU^ol 1 U 


/O.Dl O 


U 


35 


wi I iMf'ionnoo/ion 
n lu in vazu u^: y 4 <iu 


^ nn 
1 UU 


0 




nLUlNo^UU*iy4yu 


Ol.l/d 


U 




wi I iMf^onnQn/ion 
n LU In ojiUUoU4£U 


i nn 
1 UU 


U 




riLuiNo^uuou4yu 


1 nn 
l UU 


n 
U 




wi i iMfzonn^nftin 

nLUlNO^UUoUO 1 u 


1 nn 
1 UU 


U 


40 


WI 1 IMfiOOftQI ft9n 
riLUlNOilUUo 1 b^U 


Qn OQ7 


U 




n LU IN o^UUooU bU 


ob.o^y 


u 




WI 1 IMf'OnnQQQin 
M LU IN lazlUUOOO 1 U 


1 nn 
1 UU 


U 




riLUiNo^uuoooou 


^ nn 
i UU 


U 


45 


wi I iMfionnQ/io^n 
n lu in o^uuo4y / u 


7Q QylQ 

/ y.o4y 


U 


wi i iMf^onnQ?-! /in 
nLUINo^UUo/ 1 4U 


1 nn 
I UU 


U 




hi I iM^9nn<i7i fin 


1 nn 

1 UU 


n 
U 




hi i iMf^9nn^77fln 

rlLUINO^UUO/ / OU 


4/1 7ft 1 
44. / O I 


n 
u 




wi i iM^onn<iftQQn 

n LU IN lailUUO DO o u 


i nn 
1 UU 


n 
U 


50 


r1LUINo^UU41 04U 


1 nn 
I uu 


n 

u 




HLUNG20042730 


100 


0 




HLUNG20045340 


7.67 


0 




HLUNG20047070 


100 


0 


55 


HLUNG20050760 


100 


0 


HLUNG20051330 


100 


0 




HLUNG20054790 


100 


0 




HLUNG20055240 


100 


0 
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Table 11 (continued) 



10 



15 



20 



25 



30 



35 



40 



45 



Clone ID 


HLUNG 


TLUNG 


HLUNG20056560 


75.961 


o 


HLUNG20057380 


100 


0 


HLUNG20059240 


100 


o 


HLUNG20060670 


100 


o 


HLUNG20063700 


100 


o 


HLUNG20065700 


62.8 


o 


HI UNG20065QQ0 


100 


n 

u 


HLUNG20067810 


100 


o 


HLUNG20Q68120 

1 1 ^V** 1 « V_4^wwW 1 £- w 


50.947 


o 


HLUNG20069350 


100 


o 


HLUNG20070410 


100 


o 


HLUNGP007P1 00 


54.241 


o 


HI UNG200721 Q0 


79 349 


n 
u 


HLUNGP0G7P450 


7 744 
/ . / *+*+ 


ft 
u 


HI LJNG20074' : i^0 


100 


ft 
u 


HI UNGP007Q^10 


1 00 


ft 
u 


HI LJNGP0081 'SQO 


fifi 4PQ 


ft 
u 


HI UNftPOOSI R*30 


1 00 


ft 
u 


HI UNftPOOSP^RO 


1 00 


ft 
u 


HI UNCiPOOS'^IO 


1 00 
1 uu 


ft 
u 


HLUNGP008^480 


I O . 1 C~<J 


ft 
u 


HLUNGP0083840 


1 00 
1 uu 


ft 
u 


HI I INfiPOOS^QRO 

n L.KJ 1 ii UU OO 3 ou 


40 78 


ft 
u 


HI I INftP00847Q0 


1 00 
I uu 


ft 
u 


HI UNfiP0085P10 


*^0 QQ<* 


ft 


HI UNftP00887R0 


1 00 
I uu 


ft 
u 


HI UNfiPOOQPR^O 


100 
I uu 


ft 
u 


HI LJNGPOOQ^mO 


100 
1 uu 


ft 
u 


HI LJNfiP00Q41 ^0 

r l 1 N VJ£_UUc7H 1 ou 


7C QQ7 


ft 
u 




88 P88 


ft 
u 


PRO ^TPOORPft^O 


57 701 


ft 


<skn winpooofi^o 


P 1^4 


ft 
u 


9PI FNP001P4R0 




ft 
u 


TFRTIP00R75Q0 


17.804 


ft 
u 


TESTI20061200 


29.123 


0 


TESTI20067480 


18.856 


0 


TESTI201 16050 


30.168 


0 


THYMU1 0004280 


7.603 


0 


THYMU20010180 


79.349 


0 


TRACH20011010 


22.907 


0 


UTERU20016580 


43.64 


0 


UTERU20 127030 


66.318 


0 



Table 12 





Clone ID 


NOVAR 


TOVAR 


55 


KIDNE20089870 


91.868 


0 




NT2RP70075300 


76.633 


0 




TESTI201 32310 


94.177 


0 
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Table 13 



Clone ID 


STOMA 


TSTOM 


BNGH420087430 


0 


91.629 


BRAMY20227860 


1.35 


0 


BRAWH20027250 


42.096 


0 


CTONG201 74440 


26.346 


0 


FEBRA20090220 


4.442 


0 


PUAEN 10000650 


13.031 


0 


SMINT20023110 


72.094 


0 


SMINT20030740 


5.514 


0 


SMINT20045890 


34.092 


0 


SPLEN20048800 


2.011 


0 


SPLEN201 39360 


79.641 


0 


TESTI20063410 


28.273 


0 


TESTI201 50920 


33.158 


0 


TRACH20026640 


21.272 


0 


UTERU20041970 


0 


72.886 



Table 14 



Clone ID 


UTERU 


TUTER 


MUnbLtUUtUtyU 


£. \ .OOO 


U 


BRACE20038920 


10.185 


0 


BRAMY20091230 


39.224 


o 


BRAMY20093490 


62.465 


I ° 


BRAMY20227860 


2.268 


0 


BRHIP20005060 


61.644 


0 


CTONG20069320 


18.336 


0 


CTONG20083430 


62.039 


0 


FCBBF30005360 


41 .409 


0 


FCBBF30257370 


21.099 


0 


FEBRA20038330 


5.808 


0 


i FEBRA20039260 


7.74 


0 


! FEBRA20040260 


43.146 


0 


I FEBRA20078180 


31 .447 


0 


FEBRA20087550 


10.505 


0 


HLUNG20015070 


9.042 


0 


HLUNG20015180 


7.196 


0 


1 MESAN20007110 


50.674 


0 


! MESAN20067430 


20.114 


0 


! MESAN20095800 


14.162 


0 


NT2RP70057500 


14.698 


. 0 


SKMUS20008730 


2.775 


0 


SKNMC20006350 


1.084 


0 


SMINT20035050 
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35 



Clone ID 


FEHRT 


HEART 


BRAMY20043630 
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7.465 


BRAMY20072870 


0 


76.411 


BRAMY20227860 


0 


2.943 


BRAWH20093070 


0 


25.522 


BRCAN 10001 680 


0 


14.799 


FCBBF30053300 


86.185 
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FEBRA20078800 


0 


33.301 


FEBRA20090220 


0 


9.681 


HCHON20000870 


0 


23.022 


HEART1 0001420 
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14.37 
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0 


23.745 
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0 


67.646 
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0 


48.135 
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0 


5.186 


SKMUS20008730 


0 


27.003 


TESTI20270130 


0 


83.925 



Table 38 
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55 



Clone ID 


FEKID 


KIDNE 
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0 
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0 
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0 


29.719 
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0 


2.613 


BRACE20219360 


0 


59.494 


BRAMY20001510 


68.103 


0 


BRAMY20003540 


0 


4.676 


BRAMY20003880 


0 


16.882 


BRAMY20043630 


0 


3.51 


BRAMY20204270 


0 


3.618 


CTONG20033750 


0 


59.93 


CTONG20039370 


0 


59.93 


CTONG20045500 


0 


59.93 


FCBBF20023490 


0 


14.59 


FEBRA20039260 


0 


7.084 


FEBRA20040290 


0 


7.711 
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(continued) 




Clone ID 
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(continued) 
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rJU!N C.£.KJ I o^OfU 


U 


i nn 
I UU 


1 k'lnMPOHl BR1 7H 
PuUIN C*iU I OD 1 /U 


a 
U 


i nn 
1 UU 


j rviuiN t^lU 1 OOOOU 


U 


1 nn 
l UU 


! kinNPom Ronan 


u 


i nn 
l UU 


i^inMPom AQOAO 
! r\iurMci£u i oyyou 


U 


H AA 
1 UU 


1 r\iuiNC£Ui yi o/ u 


u 


H AA 

lUU 


Of^RRPOni 7i»lQO.n 

uoDDr*iUi /4oyu 


a 
U 


qo o>M 




A 

u 


1 U.oU i 


rLMOtOUl olo/U 


a 
U 


/in ooi 


rnUo 1 ^UU l o/ bU 


A 

U 


o ooo 


rU Atrsii UUUUboU 


A 

u 


6.676 


Gi^MMf^onnneocn 


*£4.4^y 


0 


orLclVI^UU 1 /o l U 


A 
0 


42.429 


CDI CNIOAACQOCA 


A 
U 


3.996 


SPLEN20126110 


0 


50.05 


SPLEN201 35030 


0 


31.695 


TESTI20061200 


0 


13.537 


TESTI20262150 


0 


40.96 


THYMU1 0004280 


0 


3.534 


THYMU20139160 


0 


2.704 


TRACH20011010 


0 


10.647 



Table 39 



Clone ID 


FELNG 


HLUNG 


BRAMY20001510 


0 


5.948 


BRAMY20043630 


0 


15.102 


BRAMY20204270 


0 


7.785 


BRAMY20227860 


0 


1.488 


CTONG20029030 


0 


28.504 


CTONG201 68460 


0 


76.291 


CTONG201 86290 


0 


61.67 


FEBRA20039260 


0 


7.62 


FEBRA20078800 


0 


33.686 


FEBRA201 63980 


0 


38.327 


HCHON20000870 


0 


23.288 


HLUNG20008460 


0 


67.54 


HLUNG20009260 


0 


100 


HLUNG20009550 


0 


100 


HLUNG20010130 


0 


100 


HLUNG20011260 


0 


100 
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Table 39 


(continued) 




Clone ID 


FELNG 


HLUNG 


HLUNG20011440 


o 


1 nn ' 

1 uu 


HI UNGP0011460 


o 


7fi R77 


HI UNGP001P140 


u 


mn 

I UU 


HLUNG20014590 


o 


36.045 


HI UNGP0015070 


0 
u 


1 7 804. 

1 / .OU4 


HLUNGPQ015180 


n 
u 




HLUNGP00P0500 


n 
u 


mn 

1 uu 


HLUNGP00P0850 


n 
u 


R7 488 

O/ .HOO 


1 1 1_ LJ 1 >i VJJil U U*- I4UU 


n 

u 


R8 nnR 

OO.UUO 


hi UNGPnnp^n^n 


a 

u 


1 nn 
I uu 


hi UNGpnnp4n^n 


n 
u 


1 nn 
l uu 


HI I JNGPfinPRRPfi 


n 
u 


1 nn 
l uu 


hi UNGpnnpflun 


n 
u 


7R R1 ft 
/ O.O l O 


HI I JNGPnnPQ4Pfi 


a 
U 


1 nn 

I uu 


HI UNGP0nPQ4Qn 


u 


fl1 1 7Q 


hi i iNGpnnin4pn 


u 


1 nn 
1 uu 


hi i iNGpnn^n4Qn 


n 
u 


1 nn 
i uu 


hi i iNGpnn^nfiifi 

nLUINutUUOUD 1 U 


u 


1 nn 
1 uu 


hi i iMfisnnQi Ron 


u 


OA OQ7 


Hi i iMf^onnQo^ftn 

nLUI\lo£UUo^40U 


A 

u 


>M AQ7 
44. Uo/ 


hi I iMf^onnQ^nftn 


a 
u 


oe coo 


hi I iNGPnnn^m 


u 


1 nn 

l UU 


hi I iNf^onn^Q^n 

n LUINO^UUOOOOU 


u 


i nn 
1 UU 


hi I !M^pnnQ4Q7n 
nnji\io^uuo4y/u 


u 


70 Q/IO 


hi NN^pnn^7iAn 


u 


i nn 
1 UU 


HI I iNfiPnn^7i Rn 


u 


i nn 
1 UU 


hi i iM^pnn^77Rn 


u 


/lil 7ft i 
44. 1 0 1 


hi t iNif^pnrnft'Wi 


A 

u 


■i nn 

1 UU 


ni_UI\lJ^UU4 1 D4U 


A 

u 


4 AA 

1 UU 


hi 1 iNir;onn4i Ron 


A 

u 


A a OA7 
1 U.^iU/ 


hi i iNifionn407'5n 


A 

u 


i nn 
lUU 


hi i iMripnn^RQAn 

n LU IN U U4004U 


A 

u 


7 R7 
/.Of 


hi i iMnpnn47n7n 


A 

u 


1 nn 
l uu 


hi i iMr;pnriRn7Rn 


A 
U 


1 nn 
1 uu 


hi i iNnonnRi Q^n 


A 
U 


i nn 
1 UU 


hi i iMripnnRPQnn 


A 
U 


OQ £1 1 


hi i iNf^9nnRA7on 
ni_ui\io^uuo4/yu 


A 
U 


i nn 
1 uu 


hi i iNfionnRRO/in 
n i_ u i n vat uuoot*f u 


A 
U 


1 nn 
1 UU 


HI 1 INGPHHRRRRO 
ni_ui\o^uuoooou 


A 
U 


7R Q«1 

/o.yo i 


hi i JNGpnnR7^ftn 


A 
U 


1 nn 
I uu 


hi i JWGpnnRQPdn 


A 
U 


1 nn 
I uu 


hi i JNGpnnRDR7n 


A 
U 


1 nn 
I uu 


hi i JWGpnriR?7nn 


A 
U 


mn 

I uu 


HLUNG20065700 


0 


62.8 


HLUNG20065990 


0 


100 


HLUNG20067810 


0 


100 


HLUNG20068120 


0 


50.947 


HLUNG20069350 


0 


100 


HLUNG20070410 


0 


100 


HLUNG20072100 


0 


54.241 
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Table 39 


(continued) 




Clonp in 

V-»l \JI IC 1 \J 


FELNG 


HLUNG 


Mi i iM^9AA7bi on 


A 
\J 


7Q Q>1<5 

/y.o4y 


n L U IN \Jt£ U U / £40U 


A 
U 


7 7vM 
/ 44 


mi I iMfionn7/!QQn 
nLUl\lo*£UU/4ooU 


U 


1 AA 
I UU 


Ml I iMfionn70Qi n 
riLUiMva^uu/ yoi u 


u 


"I AA 

I UU 


mi i im^qaahi qoa 
nLUI\o<iUUo i oyu 


a 

u 


oo.4^y 


nLUlNO^UUol DoU 


a 
U 


^ AA 

1 UU 


nLUI\Jo£.UUo2ooU 


a 
U 


H AA 

1 UU 


mi I iMfiOAAQoooA 
nLUlNu^UUooooU 


a 
U 


H AA 

1 UU 


Ml 1 IMl'iOAAQQ/IQA 

nLUI\JLa2UUoo4oU 


a 
U 


io.12o 


Ml 1 IMfiOAAQOO/in 

nLUr\ILj2UUooo4U 


a 
0 


H AA 
1 UU 


Ml 1 IM/^OAAQQQRA 


A 
U 


4U./0 


ui i iMPonfiQ/t7on 


0 


lOQ 




0 


CA AAO 

50.993 


nLUiNvj^iUUoo/oO 


0 


1 00 


LJI 1 IM^OnrtOOCOft 

nLUNvj^UUy^ooO 


0 


4 AA 
1 00 


nLUiNo^UUyoUoO 


0 


1 00 


nl_Ul\J(j2UUy4 1 oU 


0 


75.987 


ftl UIN b^Ul 4^yuu 


0 


0O.2DO 


rnUb 1 2UU l D/60 


0 


8.589 


DoncTooAcoocn 
rnUb 1 200o2o5U 


0 


57.701 


oi/AiMf'OAAAeocA 


0 


2.134 


CfkAlMTOAAOCACA 
oMIfM 1 200ob050 


0 


6.135 


brl_hN200 12450 


0 


25.695 


TESTI20057590 


0 


17.804 


1 Lb 1 120061200 


0 


29.123 


1 bb 1 1200b/48Q 


0 


18.856 


TESTI201 16050 


0 


30.168 


THYMU 10004280 


0 


7.603 


THYMU20010180 


0 


79.349 


THYMU20139160 


0 


1.939 


TRACH20011010 


0 


22.907 


UTERU20016580 


0 


43.64 


UTERU201 27030 


0 


66.318 



Table 40 

Alteration of the expression level of each clone due to TNF-a stimulation to human monocyte cell line THP-1 
and alteration of the expression level of each clone due to co-culture of gastric cancer cell line MKN45 with Helico- 
bacter pylori, ctl, TNF_1 h, and TNF_3h in the column of THP-1 , respectively, indicate the relative mRNA expression 
levels in unstimulated THP-1 , in the cell stimulated with 10 ng/mL TNF-a for 1 hour, and in the cell stimulated with 
10 ng/mL TNF-a for 3 hours; ctl, Hp, and AcagE in the column of MKN45 indicate the relative mRNA expression 
levels in MKN45 cultured without Helicobacter pylori, in the cells co-cultured with cag PAI-positive Helicobacter 
pylori (TN2) (at a ratio of MKN45 : TN2 = 1 :100 cells (colonies)) for 3 hours, and in the cells co-cultured with the 
cagE mutant (TN2AcagE) (at a ratio of MKN45 : TN2AcagE = 1:100 cells (colonies)) for 3 hours, respectively. 
[ATAC-PCR] 



Clone name 


THP-1 


MKN45 




Ctl 


TNF_1h 


TNF_3h 


ctl 


Hp 


AcagE 


ASTR020045840 


1.5 


2.3 


1.9 


2.3 


2.4 


0.2 


ASTR020055930 


0.8 


1.9 


1.4 


0.8 


0.8 


0.5 
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Table 40 (continued) 



5 



10 



15 



20 



Clone name 


THP-1 


MKN45 


ctl 


TNF_1h 


TNF_3h 


ctl 


Hp 


AcagE 


a 9TRnpnnftflQ*^n 


1.0 


0.2 


2.5 


1.1 


0.3 


0.3 




2.2 


2.2 


0.0 


0.5 


0.5 


3.2 




2.6 


1.0 


0.3 


2.2 


1.0 


0.9 


RRAHFpnnRdnftn 

□ nnwLcUUOtuou 








0.8 


1.1 


1.0 


RRAMYPOOOIRRO. 


1.5 


0.9 


0.6 


1.2 


0.0 


1.3 


RRAMY^nn97^Qn 


0.6 


4.2 


0.1 


2.9 


0.4 


0.1 


D nMIVI T <£UU<lOOoU 








0.5 


3.4 


4.1 


RRAMYpnn^R^ftn 

D nMIVI T iUUjOJOU 


1.3 


0.9 


0.8 


0.5 


1.7 


0.5 


D riMIVI T cUUuQOJu 








1.1 


0.3 


0.3 


RRAMY9nn*=inQ4.n 


0.4 


0.0 


0.0 


0.2 


0.2 


1.0 


RRAMYpnn7Pddn 








0.5 


0.5 


0.1 


RRAMVonnoco^fi 

DHMIVI T ^UUaQaJU 


3.8 


5.2 


4.7 


1.7 


2.4 


1.7 


RRAMVOfM 1 R41 0 
DnMIVI T £U 1 1 0*1 1 u 


0.8 


3.7 


0.8 


2.7 


0.0 


0.0 


do Afc/ivonoQ7i on 


0.0 


2.3 


0.1 








DrtMVVn^UUOOooU 


2.5 


4.7 


2.8 


1.2 


1.3 


0.0 


DnMvvn^uu/ oo^u 








1.9 


1.2 


1.0 


DnnVVncU I cjUOOU 


0.6 


0.1 


0.0 


0.3 


0.0 


0.0 


RRfAMOfinni/iQn 

D nL/MIN^UUU 1 *H3U 


1.0 


3.4 


4.1 


1.4 


1.6 


0.6 


rrmi D1 nnnn7on 

Dnnlr I UUUU/^U 


0.3 


1.8 


1.4 


0.9 


0.0 


1.7 


rrmipi nnm n^n 
Dnnlr i uuu I U4U 


0.9 


0.7 


0.1 


0.0 


0.1 


0.0 


R QUI I DonnnAO i A 
D nn I r^UUUU^ 1 U 


0.6 


0.6 


0.0 


2.0 


0.3 


0.0 


RRQQMonnni 07H 


0.8 


1.4 


1.3 


0.8 


0.7 


0.5 


RRQCMonnonon 
DnooiNtuuyi i yu 








0.6 


0.1 


0.8 


UUo4L/^UUU 1 /oU 


0.0 


0.4 


2.0 








PTn M I'i O A fi 7Q Q/lfl 


0.3 


2.6 


1.6 


0.9 


0.7 


2.4 


u i or>io<:uu/yoyu 


1.0 


1.2 


0.2 


0.1 


0.0 


0.0 


L» 1 UlNui^UUooyoU 


0.0 


0.0 


0.0 


1.3 


1.8 


1.9 


O 1 vJINta^UUoo^i l U 


0.8 


1.2 


2.3 


0.1 


0.2 


0.1 


U r l\l t U U oo*k) u 


1.7 


3.6 


2.7 


1.3 


2.0 


0.1 


UrNbo/cuu/^yyU 


1.4 


1.9 


2.0 


5.0 


4.9 


4.2 




1.8 


5.5 


3.8 


2.3 


2.2 


2.3 




0.1 


0.1 


0.0 


1.7 


0.5 


0.5 


rODDr<iUUooooU 


0.6 


0.7 ! 


0.4 


0.4 


0.2 


0.2 


pr^ r r CQnnoo r r n 
roDBroUU^^ooU 


2.9 


1.0 


0.3 


2.9 


1.0 


0.3 


rODDroUU/ oouu 


1.1 


2.6 


0.6 








ruDBrdUl UouoU 


1.8 


1.6 


1.9 


0.2 


0.1 


0.0 


Pf^RRPQfH ftQQ7n 

rL/DDroUl oyo/U 


1.1 


1.2 


0.2 


1.6 


0.3 


0,3 


Ff^RRPQOOORO^n 


2.2 


0.8 


1.0 


1.1 


0.7 


0.2 


rODDrOUUUUu 1 U 


2.1 


2.8 


2.2 


2.0 


2.2 


1.3 


rCDn/icUUU/OtU 


0.0 


1.7 


2.4 


2.1 


1.4 


1.2 


FEBRA20031280 


0.1 


1.8 


4.5 


0.5 


0.0 


0.0 


FEBRA20031810 


1.4 


3.9 


3.5 


1.5 


2.1 


1.9 


FEBRA20039260 


2.0 


3.0 


2.5 








FEBRA20046280 


1.3 


0.3 


0.3 








FEBRA20084750 








2.5 


2.2 


0.3 


FEBRA201 82030 


3.0 


4.0 


4.2 


1.6 


0.3 


0.7 


HLUNG20041540 


0.0 


2.2 


2.2 


1.9 


2.4 


0.2 


HLUNG20092530 


0.3 


0.3 


3.1 


0.2 


0.2 


0.9 
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Table 40 (continued) 



Clone name 


THP-1 


MKN45 


ctl 


TNF_1h 


TNF_3h 


ctl 


Hp 


AcagE 


KIDNF20084030 


1.6 


0.1 


0.3 


0.1 


0.0 


0.0 




0.6 


0.3 


0.0 


0.5 


0.5 


1.1 


KIDNE20134130 


0.4 


0.4 


0.5 


2.3 


1.2 


1.6 


KIDNF20 182540 








1.1 


0.3 


0.3 


KIDNE20186170 


0.0 


0.0 


0.0 


0.6 


0.0 


0.0 


rviLvix i_<_u i oouou 


1.5 


0.5 


0.5 


0.4 


0.3 


0.0 


I iVFR?nnn77^n 

LI VLncUUU/ / OU 


1.3 


1.8 


0.3 


1.9 


0.7 


0.0 


M AW?nn21 92fl 

I VI LIOrAI ^UU£ 1 ^^U 


1.7 


2.6 


1.6 


2.3 


0.9 


0.3 


IVI LOAAI M£UUDH 1 JU 


0.8 


2.6 


2.2 


1.7 


2.1 


1.1 


IN 1 C\ N L^£.UUOC7^ 1 U 








1.4 


0.4 


0.1 


NT2NFP00821 30 

IN I ClitLUUOL IOU 


1.8 


1.3 


0.5 


1.9 


0.3 


0.3 


IN 1 £.IN C*lUUi7<Si70U 


1.3 


2.7 


3.4 


1.7 


2.4 


1.8 


NTPRP7nn3i 070 

IN Itnr / UvO IU/U 


0.3 


0.9 


1.4 


0.4 


0.4 


0.0 




0.3 


0.3 


1.3 


0.9 


0.2 


1.2 


nrRRP9nn noi n 


4.5 


1.8 


0.5 


0.5 


0.5 


4.5 


nnRRF?m 1 n73n 


0.4 


0.5 


0.3 


0.1 1 


0.0 


0.0 




1.1 


1.1 


0.9 


2.5 


0.8 


0.3 


OPRRF901 fifiQnn 


1.7 


4.4 


4.4 


1.9 


3.3 


1.1 


UUDDrtU 1 /UOOU 


0.9 


5.4 


0.3 


0.3 


1.7 


1.5 


OPRRFOni 7fififirt 
UODDr^U I /OOoU 


0.5 


1.3 


0.5 


0.7 


0.7 


0.0 


r LAU COUUUDoUU 


0.8 


3.2 


1.0 


0.5 


0.5 


0.9 


Pi APFfionfii <*7n 


2.7 


0.8 


1.5 


1.2 


1.0 


1.3 


pro^tpooi 1 1 fin 


0.2 


1.7 


2.0 


1.3 


5.5 


1.7 


PPPiQTOnnAI A fin 
rnUo 1 cXiyj** 1 *fOU 


2.9 


0.1 


1.5 


0.6 


0.0 


0.0 


rnuo i *iuuoo i uu 


1.4 


1.9 


s 1.6 


5.3 


0.0 


0.0 


r rlL/o 1 iiUu/OZoU 


1.7 


0.5 


0.5 


2.6 


0.5 


0.5 


DPHQTOftl nftnfin 
r nUo 1 1 UOUOU 


0.2 


0.2 


2.0 


1.0 


0.6 


0.7 


PRnQTom 1 ni on 
rnuo i csj 1 1 u i ^u 


1.2 


0.6 


0.5 








Qk'Mi iQonnoionn 
orviviuo^uuyi yuu 


1.2 


1.6 


0.3 


1.2 


2.6 


0.3 


O IVI UN 1 £\J\J£A 1 **U 








0.0 


0.1 


0.0 


QMiMT^nnooi fin 
oivhin i ^uuy^i ou 


0.5 


0.7 


0.3 


1.3 


1.2 


0.3 


QPI PM*jnn/4n7Qn 
o r Lc l\I^UU*HJ / OU 


1.0 


2.8 


1.9 


0.3 


0.8 


0.1 


OrLCIMtUl 1 UOOU 


2.4 


0.1 


7.6 


1.4 


0.1 


0.1 


^pi FM^ni77Ann 

orLCINtU 1 / / *+UU 


0.8 


3.3 


1.3 


1.3 


0.7 


0.3 


TF^Ti2nn3R94n 








0.1 


0.0 


0.0 


TF ^ti onnAi 1 <*n 

I CO t I*lUU*+0 I ou 


0.0 


0.1 


0.7 








TF ^Tl 9004fifi40 

I CO I I^UUHOO'fU 


1.1 


0.8 


0.2 


1.1 


0.8 


0.3 


TFQTionnA7^7n 

I CO I ltUU4/ o/u 


0.4 


0.4 


0.5 


0.6 


0.0 


0.5 


TFQTionnfi79nn 

I Co I I^UUO/^UU 


2.5 


0.0 


1.2 


1.1 


0.4 


0.3 


TF^TI900fi7fiQ0 

I CO) 1/iUUO/OaU 


0.1 


0.1 


0.0 


3.8 


3.5 


2.2 


TESTI201 13940 


5.2 


0.2 


0.2 


4.4 


0.4 


0.4 


TESTI201 49880 








2.2 


0.2 


2.2 


TESTI20151800 


2.1 


3.3 


2.3 


2.5 


1.0 


0.3 


TESTI201 73050 


0.8 


0.6 


0.5 


1.8 


1.1 


1.0 


TESTI201 98600 








2.2 


0.2 


2.2 


TESTI20257910 


1.2 


0.3 


0.3 


1.2 


0.2 


0.7 


TESTI20262940 


1.5 


1.1 


0.2 


1.1 


1.3 


0.2 


THYMU20046770 








1.7 


0.5 


0.5 
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Table 40 (continued) 



5 



10 



15 



20 



Clone name 


THP-1 


MKN45 


Ctl 


TN F_1 h 


TNF_3h 


ctl 


Hp 


AcagE 


THYMU20058550 








1.9 


0.1 


0.1 


THYMU20062520 


0.0 


0.0 


0.7 


0.2 


0.3 


0.1 


THYMU20062770 








1.6 


1.0 


0.3 


THYMU20078240 


0,3 


1.7 


2.9 


0.0 


1.0 


0.1 


THYMU20150190 


0.2 


0.2 


0.6 


1.4 


0.5 


1.9 


TR ACH201 25620 


1.1 


2.5 


1.5 


1.4 


1.4 


1.0 


TRACH201 49740 


9.4 


9.4 


0.9 


1.6 


2.1 


0.6 


TRACH201 Q0460 


2.0 


3.3 


3.1 


0.2 


1.1 


0.2 


LJTFRU20045200 


1.4 


2.6 


3.9 


0.9 


2.1 


2.5 


UTERU20064120 


0.6 


2.7 


2.0 


0.6 


1.7 


0.4 


UTFRU201 03200 


0.1 


0.0 


0.9 


2.4 


1.7 


0.9 


ADRGI 2004fi7fi0 








0.5 


0.5 


0.5 


ASTROPOOW^O 


0.7 


2.4 


1.7 


0.8 


2.1 


1.6 


RRAMY200761 ^0 


0.1 


1.4 


0.2 


0.1 


0.0 


0.0 


CTONG201 70Q40 


1.6 


0.8 


0.3 


0.5 


9.9 


0.5 


FCBRF200^^R0 


0.1 


0.4 


1.4 


6.4 


0.3 


0.7 


FCRRF^02S7 / ?70 
ruuurduc.0/ o / w 


2.7 


0.3 


0.3 


2.2 


0.1 


2.2 


FCRRF^0001 fi^O 


1.0 


1.6 


1.2 


1.3 


1.0 


0.9 


FFRRA200402QO 


0.4 


1.9 


1.1 


0.4 


1.5 


1.4 


FFRRA200R^72n 


2.9 


3.3 


3.1 


1.5 


0.7 


1.4 




2.5 


2.1 


1.5 


4.2 


0.0 


0.3 


FFRRA201 OflRRO 


1.4 


2.8 


2.8 


0.7 


0.2 


0.2 


M F^ AN20021 RRO 


0.1 


1.2 


0.2 


0.9 


1.0 


0.9 


IVI COnlN^UUO / tOU 


0.8 


3.0 


1.5 


0.0 


0.3 


0.0 




0.7 


0.2 


0.2 


0.8 


0.6 


0.2 


PRHQT9nni fi7fiD 








1.1 


3.4 


2.3 


^k , NQH9nnn7i fin 
or\iMorituuUf iou 


0.6 


0.3 


0.2 


1.1 


0.7 


0.6 


SMiMT?onnfin2n 

OIVIMN 1 cUUUQU^U 


2.5 


1.0 


0.3 


2.4 


0.3 


0.3 


TF^TIPn0^Q^70 


0.0 


0.2 


0.3 


0.0 


0.0 


0.0 


TFQTI901 mfiQO 


0.0 


0.0 


0.0 


0.8 


0.3 


0.3 


1 Co 1 I^U*£0*f*foU 


0.0 


0.1 


0.6 


0.8 


0.6 


0.0 


thymi H nnn49ftn 

I n T iviu I UUUHtOU 


0.2 


2.2 


0.2 


0.2 


1 .1 


1 .4 


THYMU20030460 


0.2 


0.6 


0.2 


0.6 


0.8 


2.4 


TRACH20090060 


0.5 


0.3 


2.0 


0.0 


6.8 


1.7 


UTERU20041970 


1.6 


1.1 


1.5 


0.0 


3.5 


2.4 


BRAMY20 125360 


0.8 


1.6 


1.1 


0.0 


0.0 


0.0 


OCBBF20142290 


0.2 


0.9 


0.3 


0.1 


0.4 


0.0 


SKMUS20006790 


0.7 


0.5 


0.3 


1.0 


0.8 


0.7 


TESTI20030610 


0.1 


2.0 


1.7 


0.0 


0.3 


0.0 


UTERU20026620 


0.4 


4.2 


4.2 


2.4 


2.7 


0.9 



50 

Homology Search Result Data 

[031 3] Data obtained by the homology search for full-length nucleotide sequences and deduced amino acid sequenc- 
es. 

55 [0314] In the result of the search shown below, both units, aa and bp, are used as length units for the sequences to 
be compared. 

[0315] Each data includes Clone name, Definition in hit data, P value, Length of sequence to becompared, Homology, 
and Accession number (No.) of hit data. These items are shown in this order and separated by a double-slash mark, //. 
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ADRGL20020290//Human placental equilibrative nucleoside transporter 1 (hENT1) mRNA, complete cds 7/1 .70E-240// 
456aa//100%//U81375 

ADRGL20021910//Homo sapiens transmembrane protein B7-H2 ICOS ligand mRNA, complete cds.//2.50E-88// 
168aa//100%//AAG01176 

5 ADRGL20022600//DIAPHANOUS PROTEIN HOMOLOG 1 (P140MDIA).//2.00E-07//121aa//36%//008808 
ADRGL20023920//ABC1 PROTEIN HOMOLOG PR ECU RSOR.//1 .40E-98//467aa//45%//Q92338 
ADRGL20026790//Homo sapiens PLIC-1 mRNA, complete cds 7/6.1 0E-05//169aa//31%//AF293384 
ADRGL20027530 
ADRGL20036380 

10 ADRGL20036840//HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, ALPHA CHAIN H 

PRECURSOR (HLA-AR) (HLA-12.4).//7.30E-68//131aa//96%//P01893 

ADRGL20040310 

ADRGL20040770 

ADRGL20046760 
15 ADRGL20047080 

ADRGL20047770 

ADRGL20057560 

ADRGL20059610//GLUCOSYLCERAMIDASE PRECURSOR (EC 3.2.1 .45) (BETA-GLUCOCEREBROSIDASE) (AC- 
ID BETA-GLUCOSIDASE) (D-GLUCOSYL-N-ACYLSPHINGOSINE GLUCOHYDROLASE) (ALGLUCERASE) (IMI- 
20 GLUCERASE).//3.00E-94//1 88aa//93%//P04062 

ADRGL20062330//Homo sapiens trabeculin-alpha mRNA, complete cds7/1 .10E-128//439aa//58%//AF141968 
ADRGL20063770 

ADRGL20066770//elastin microfibril interface located protein [Homo sapiens].//3.00E-31//210aa//44%//NP_008977 
ADRGL20067320 

25 ADRGL20079060//Mus musculus mRNA for Ky protein (muscle-specific protein). //3.80E-266//281 aa//84%//AJ293727 
ADRGL20095330 

ASTRO20001910//Rattus norvegicus mRNA for annexin V-binding protein (ABP-10), partial cds.//2.20E-57//153aa// 
73%//D64062 
ASTRO20003720 
30 ASTRO20004820 

ASTRO20006530//Homo sapiens hookl protein (HOOK1) mRNA, complete cds.//1 .80E-94//383aa//55%//AF044923 
ASTRO20009140//PUTATIVE COMPETENCE-DAMAGE PROTEIN 7/2.70 E-06//167aa//29%//P46323 
ASTRO20010010 
ASTRO20010290 
35 ASTRO20012270 
ASTRO20020240 
ASTRO20020350 
ASTRO20022020 

ASTRO20026320//NAM7 PROTEIN (NONSENSE-MEDIATED MRNA DECAY PROTEIN 1) (UP-FRAMESHIFT SUP- 
40 PRESSOR 1 )7/1 .80E-47//432aa//33%//P30771 
ASTRO20027330 

ASTRO20038400//Homo sapiens zinc finger homeobox protein ZHX1 mRNA, complete cds.//3.40E-78//282aa//42%// 

AF1 06862 

ASTRO20045840 

45 ASTRO20046280//PSU1 PROTEIN7/1 .30E-42//228aa//36%//P53550 
ASTRO20047510 

ASTRO2005081 0//L-RIBULOKINASE (EC 2.7.1 .1 6)7/1 .1 0E-43//51 2aa//30%//P94524 

ASTRO20052420//PROBABLE GUANINE NUCLEOTIDE REGULATORY PROTEIN TIM (ONCOGENE TIM) (P60 
TIM) (TRANSFORMING IMMORTALIZED MAMMARY ONCOGENE).//4.80E-151//408aa//76%//Q12774 
50 ASTRO20053430//BAND 4.1 -LIKE PROTEIN 4 (NBL4 PROTEIN).//1 .80E-58//307aa//38%//057457 
ASTRO20055530 

ASTRO20055570//MAJOR PRION PROTEIN PRECURSOR (PRP) (PRP27-30) (PRP33-35C) (AS C R) 7/5. 40 E -72// 

137aa//100%//P04156 

ASTRO20055930 

55 ASTRO20058960//DNA damage inducible protein homolog - fission yeast (Schizosaccharomyces pombe)//1 .90E-1 All 
205aa//31%//T39541 
ASTRO20069200 

ASTRO20075150//TNF RECEPTOR ASSOCIATED FACTOR 3 (CD40 RECEPTOR ASSOCIATED FACTOR 1) 
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(CRAF1) (CD40 BINDING PROTEIN) (CD40BP) (LMP1 ASSOCIATED PROTEIN) (LAP1).//1 .00E-25//60aa//98%// 
Q13114 

ASTRO20076660 

ASTRO20085080//TIPD PROTEIN.//1 .80E-58//307aa//37%//015736 

5 A STRO20088950//LACTASE- PHLORIZIN HYDROLASE PRECURSOR (LACTASE-GLYCOSYLCERAMIDASE) [IN- 
CLUDES: LACTASE (EC 3.2.1.108); PHLORIZIN HYDROLASE (EC 3.2.1 .62)] 7/7.80 E-85//331aa//48%//P09848 
ASTRO20089600//Mus musculus sacsin gene, complete cds.//1.10E-05//198aa//26%//AF 193557 
ASTRO20090680//M.musculus mRNA for IB3/5-polypeptide7/1 .60E-173//412aa//78%//X79131 
ASTRO20091180 

10 ASTRO20091770 
ASTRO20141740 

BGGI120000670//Rattus norvegicus myosin heavy chain Myr 8b mRNA, complete cds.//1 .660E-05//86aa//36%// 
AY004215 

BGGI120010750//Rattus norvegicus mRNA for SECIS binding protein 2 (sbp2 gene).//7.90E-305//812aa//71%// 
15 AJ251245 

BNGH41 0000570 

BNGH420008150//Human SH3 domain-containing proline-rich kinase (sprk) mRNA, complete cds7/3.40E-139// 

326aa//82%//U07747 

BNGH420014060 

20 BNGH420015760//Mus musculus mRNA for JNK-binding protein JNKBP1 , complete cds.//1.60E-130//381aa//60%// 
AB029482 
BNGH420021680 

BNGH420023870//RIBONUCLEASE INHIBIT0R.//4.70E-41//314aa//34%//P10775 
BNGH420024870 

25 BNGH420035290//MYOSIN HEAVY CHAIN KINASE B (EC 2.7.1 .129) (MHCK B).//7.60E-37//273aa//35%//P90648 
BNGH420036410 
BNGH420040760 
BNGH420042910 
BNGH420045380 

30 BNGH420046790//immunoglobulin lambda light chain variable region [Homo Sapiens].//5.00E-47//84aa//100%// 
AAG24674 
BNGH420052350 

BNGH420059680//DIPZ PROTEIN.//3.00E-13//1 66aa//31%//Q 10801 
BNGH420061350 
35 BNGH420062340 

BNGH420070370//ZINC FINGER PROTEIN GLI1 (GLI)7/3.20E-65//165aa//59%//P47806 

BNGH420074600//DNA-DIRECTED RNA POLYMERASE III 128 KDA POLYPEPTIDE (EC 2.7.7.6) (RNA POLYMER- 
ASE III SUBUNIT 2)7/5 .40E-214//522aa//72%//P251 67 
BNGH420075940 

40 BNGH420077980//Rattus norvegicus ankyrin binding cell adhesion molecule neurofascin mRNA, alternatively spliced 
form, partial cds.//0//588aa//98%//U81 036 
BNGH420085100 

BNGH420086030//N-CHIMAERIN (NC) (N-CHIMERIN) (ALPHA CHIMERIN) (A-CHIMAERIN).//2.30E-21//130aa// 
40%//P15882 

45 BNGH420087430//Mus musculus mRNA 1 for phtf protein7/5.80E-118//237aa//57%//AJ1 33721 

BRACE1 000051 0//KERATIN, ULTRA HIGH-SULFUR MATRIX PROTEIN (UHS KERATIN)7/4.70E-12//132aa//37%// 
P26371 

BRACE20003310//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//1 .00E-205//504aa//71%// 
Q03923 

50 BRACE20007330//RING CANAL PROTEIN (KELCH PROTEIN)7/4.60E-66//562aa//31%//Q04652 
BRACE20009050 

BRACE20014450//Mus musculus mRNA for Ndr1 related protein Ndr2, complete cds.//3.20E-150//291aa//95%// 
AB033921 
BRACE20017790 
55 BRACE20018810 
BRACE20025820 
BRACE20038920 

BRACE20050870//PRE-M RNA SPLICING HELICASE BRR2 (EC 3.6.1 .-)7/8.40E-59//584aa//31%//P32639 
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BRACE20051600 

BRACE20051930//NEUROPILIN PRECURSOR (A5 PROTEIN) (A5 ANTIGEN)7/5.50E-20//179aa//30%//P28824 
BRACE20052430//Homo sapiens AMSH mRNA, complete cds.//4.30E-75//272aa//53%//U73522 
BRACE20052530 
5 BRACE20054080 
BRACE20054480 

BRACE20054600//Xenopus laevis mRNA for Kielin, complete cds.//4.30E-70//205aa//60%//AB026192 
BRACE20055560 
BRACE20057870 
10 BRACE20059110 
BRACE20059810 

BRACE20061 620//ZINC-BINDING PROTEIN A33.//3.60E-30//329aa//28%//Q02084 
BRACE20062580 

BRACE20063540//MEROZOITE SURFACE PROTEIN CMZ-8 ( FRAGMENT). //1 .60E-10//164aa//35%//P09125 
15 BRACE20065470//Xenopus laevis ubiquitin-like fusion protein mRNA, complete cds.//6.70E-63//170aa//71%//L08474 
BRACE20066360 
BRACE20068710 

BRACE20069000//CLN3 PROTEIN (BATTENIN) (BATTEN'S DISEASE PROTEIN).//1 .20E-147//279aa//100%// 
Q13286 
20 BRACE20069110 
BRACE20069440 

BRACE20079200//Xenopus laevis mRNA for Kielin, complete cds.//3.10E-15//63aa//58%//AB026192 
BRACE20079370//microtubule associated-protein orbit [Drosophila melanogaster].//8.00E-42//282aa//36%// 
BAA94248 

25 BRACE20097540//Homo sapiens protein serine/threonine phosphatase 4 regulatory subunit 1 (PP4R1 ) mRNA, com- 
plete cds.//2.80E-96//1 93aa//96%//AF1 1 1 1 06 
BRACE20098860 
BRACE20099070 

BRACE201 94670//UDP-GALACTOSE TRANSLOCATOR (UDP-GALACTOSE TRANSPORTER) (UGT) (UDP-GAL- 
30 TR)7/1 .40E-32//72aa//98%//P78381 

BRACE20196180//Homo sapiens HMG domain protein HMGX2 (HMGX2) mRNA, complete cds.//6.90E-154//235aa// 
91 %//AF1 46223 
BRACE201 96960 

BRACE20200770//PROTEIN MOV-1 0.//3.30E-24//1 1 3aa//50%//P23249 
35 BRACE20200970 

BRACE20204670//PROTEIN-TYROSINE PHOSPHATASE ALPHA PRECURSOR (EC 3.1.3.48) (R-PTP- ALPHA).// 
4.30E-237//428aa//99%//P1 8433 
BRACE20205840 
BRACE20207420 
40 BRACE20212450 

BRACE20215410//PROTEIN-TYROSINE PHOSPHATASE YVH1 (EC 3.1 .3.48) (PTPASE 

YVH 1 )y/2.90E-08//1 36aa//31 %//Q02256 

BRACE20216700 

BRACE2021 6950//4F2 CELL-SURFACE ANTIGEN HEAVY CHAIN (4F2HC) (LYMPHOCYTE ACTIVATION ANTIGEN 
45 4F2 LARGE SUBUNIT) (4F2 HEAVY CHAIN ANTIGEN) (CD98 ANTIGEN).//4.80E-66//94aa//90%//P08195 

BRACE20219360 

BRAMY 10000980 

BRAMY1 0001 730 

BRAMY20000210 
50 BRAMY20000250 

BRAMY20001510//Homo sapiens RING zinc finger protein (RZF) mRNA, complete cds.//3.80E-131//245aa//99%// 
AF037204 

BRAMY20003540//1-PHOSPHATIDYLINOSITOL-4,5-BISPHOSPHATE PHOSPHODIESTERASE DELTA 1 (EC 
3.1.4.11) (PLC-DELTA-1) (PHOSPHOLIPASE C-DELTA-1) (PLC-lll)y/0//745aa//99%//P51178 
55 BRAMY20003880 

BRAMY20005080//UBIQUITIN C ARBOXYL-TE RM I NAL HYDROLASE 16 (EC 3.1.2.15) (UBIQUITIN THIOLESTE- 
RASE 16) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 16) (DEUBIQUITINATING ENZYME 16) (UBIQUITIN 
PROCESSING PROTEASE UBP-M).//2.70E-46//93aa//100%//Q9Y5T5 
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BRAMY20013670//PECANEX PROTEIN7/1 .80E-84//300aa//56%//P 18490 
BRAMY20016780 
BRAMY20020440 
BRAMY20021580 
5 BRAMY20023390 
BRAMY20023640 
BRAMY20024790 
BRAMY20027390 

BRAMY20027990//Homo sapiens NEDL1 mRNAfor NEDD4-like ubiquitin ligase 1 , complete cds.//4.60E-158//294aa// 
10 100%//AB048365 
BRAMY20028530 

BRAMY20028620//NICOTINATE-NUCLEOTIDE PYROPHOSPHORYLASE [CARBOXYLATING] (EC 2.4.2.19) (QUI- 
NOLINATE PHOSPHORIBOSYLTRANSFERASE [DECARBOXYLATING]) (QAPRTASE).//9.70E-18//53aa//84%// 
Q 15274 

15 BRAMY20035380//ZINC TRANSPORTER 1 (ZNT-1)7/5.60E-40//211aa//35%//Q62720 

BRAMY20035830//Homo sapiens RCd-like G exchanging factor RLG mRNA, complete cds.//3.60E-103//251aa// 

75%//AF060219 

BRAMY20036530 

BRAMY20036810 

20 B RAM Y20038980//i NTR AC ELLULAR PROTEIN TRANSPORT PROTEIN US01 Y/3.40E-1 7//407aa//23%//P25386 
BRAMY20039290 

BRAMY20040580//ZINC FINGER PROTEIN 135.//1 .70E-29//115aa//54%//P52742 
BRAMY20043520 

BRAMY2004363Q//Homo sapiens Ras-binding protein SUR-8 mRNA, complete cds.//1 .30E-167//364aa//88%// 
25 AF068920 

BRAMY20044920//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 4 (EC 3.1.2.15) (USIQUITIN THIOLESTE- 
RASE 4) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 4) (DEUBIQUITINATING ENZYME 4) (UBIQUITOUS 
NUCLEAR PROTEIN HOMOLOG.//7.60E-28//86aa//4796//Q13107 
BRAMY20045210 
30 BRAMY20045420 
BRAMY20047560 
BRAMY20050640 
BRAMY20050940 

BRAMY20051820//Human mRNA for Doc2 (Double C2), complete cds.//1 .90E-49//102aa//99%//D31897 
35 BRAMY20052440 
BRAMY20053910 

BRAMY20055760//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE VA (EC 3.6.1 .-).//5.80E-130//393aa// 
59%//054827 

BRAMY20056620//Homo sapiens mccb mRNA for non-biotin containing subunit of 3-methylcrotonyl-CoA carboxylase, 
40 complete cds.//3.00E-106//203aa//100%//AB050049 

BRAMY20056840//UBE-1 C2//2.40E-74//261 aa//53%//AB030505 

BRAMY20063750//Homo sapiens HRIHFB2007 mRNA, partial cds.//3.40E-139//253aa//99%//AB0 15330 
BRAMY20072440 

BRAMY20072870//ACETYL -CO ENZYME A SYNTHETASE (EC 6.2.1.1) (ACETATE— COA LIGASE) (ACYL- ACTI- 
45 VATING ENZYME).//2.50E-1 5//88aa//46%//P1 6929 
BRAMY20073080 
BRAMY20074110 
BRAMY20074860 

BRAMY20076100//STEROIDOGENIC FACTOR 1 (STF-1) (SF-1) (STEROID HORMONE RECEPTOR AD4BP) 
50 (FUSHI TARAZU FACTOR HOMOLOG 1).//4.80E-66//132aa//94%//P50569 
BRAMY20076130 
BRAMY20076530 

BRAMY20083330//SYNAPSIN l.//4.50E-05//155aa//29%//P 17599 
BRAMY20083820 

55 BRAMY20089770//P2X PURINOCEPTOR 7 (ATP RECEPTOR) (P2X7) (PURINERGIC RECEPTOR) (P2Z RECEP- 
TOR) .//3.30E- 1 36//242aa//99%//Q99572 

BRAMY20091230//MITOCHONDRIAL UNCOUPLING PROTEIN 4 (UCP 4).//4.60E-121//224aa//100%//095847 
BRAMY20093490//Mus musculus ubiquitin-protein ligase E3-alpha (Ubr1) mRNA, complete cds.//7.80E-43//139aa// 
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55%//AF061555 

BRAMY20094890//A KINASE ANCHOR PROTEIN 4 PRECURSOR (MAJOR FIBROUS SHEATH PROTEIN) (FSC1) 
(P82).//4.60E-06//1 31 aa//27%//Q60662 
BRAMY20095080 
5 BRAMY20095570 

BRAMY20096930//Torpedo marmorata mRNA for male sterility protein 2-like protein (ms21 gene).//2.00E-63//139aa// 

82%//AJ272073 

BRAMY201 00680 

BRAMY20102900//Homo sapiens RU1 (RU1) mRNA, complete cds.//1 .20E-47//151aa//58%//AF168132 
10 BRAMY201 07980 

BRAMY20111780//ZINC FINGER PROTEIN 135.//1 .00E-139//416aa//57%//P52742 

BRAMY201 1 7670//Mus musculus mmDNAJA4 mRNA for mmDj4, complete cds.//3.20E-1 1 8//239aa//90%//AB032401 
BRAMY201 18410 

BRAMY2011 8490//GLYCEROL KINASE 2 (EC 2.7.1.30) (ATP:GLYCEROL 3-PHOSPHOTRANSFERASE 2) (GLYC- 
15 EROKINASE 2) (GK 2)7/1 .80E-48//247aa//40%//Q9X1 E4 
BRAMY20120170 
BRAMY201 23400 
BRAMY201 24970 
BRAMY20125170 

20 B RAM Y201 25360//L-AS PAR AG I N AS E (EC 3.5.1.1) (L-ASPARAGINE AMIDOHYDROLASE).//3.30E-53//148aa// 
43%//Q9ZSD6 

BRAMY20125550//Homo sapiens mRNA for 28kD interferon responsive protein (IFRG28 gene).//4.40E-16//155aa// 
33%//AJ251832 
BRAMY201 26910 
25 BRAMY201 27310 
BRAMY201 27760 

BRAMY20134050//NUCLEOSOME ASSEMBLY PROTEIN MIKE 2 (BRAIN-SPECIFIC PROTEIN, X- LINKED).// 
1 .40E-25//1 09aa//569d//P51 860 
BRAMY201 35720 

30 BRAMY20137360//Homo sapiens gene for TU12B1-TY, exon 12 and complete cds.//2.10E-18//257aa//31%// 
AB032786 
BRAMY201 39440 
BRAMY201 39750 

BRAMY20143870//PEPTIDYL-TRNA HYDROLASE (EC 3.1.1.29) (PTH).//5.80E-27//182aa//35%//P96386 
35 B R AM Y20 1 525 1 0//P ROTE I N -TV RO S I N E PHOSPHATASE STRIATUM-ENRICHED (EC 3.1.3.48) (STEP) (NEU- 
RAL-SPECIFIC PROTEIN-TYROSINE PHOSPHATASE) (FRAGMENT).//4.80E-293//537aa//97%//P54829 
BRAMY201 55500 

BRAMY20158550//CALMODULIN.//1 .60E-15//11 6aa//42%//P04352 
BRAMY201 59250 
40 BRAMY201 60020 
BRAMY201 73480 

BRAMY20190550//EPIDERMAL GROWTH FACTOR RECEPTOR SUBSTRATE SUBSTRATE 15 (PROTEIN EPS15) 
(AF-1 P PROTEIN).//2.20E-226//464aa//89%//P42566 
BRAMY201 94680 
45 BRAMY20204270 

BRAMY20206340//GUANINE NUCLEOTIDE-BINDING PROTEIN BETA SUBUNIT-LIKE PROTEIN.//4.20E-07// 

151aa//27%//P38011 

BRAMY20219620 

BRAMY20221600//H.sapiens mRNA for novel T-cel activation protein.//1 .60E-130//245aa//99%//X94232 
50 BRAMY20223010//Mus musculus leucine-rich glioma-inactivated 1 protein precursor, (Lgi1) mRNA, complete cds.// 
2.00E-79//269aa//52%//AF24681 8 
BRAMY20225250 
BRAMY20225320 
BRAMY20227230 

55 BRAMY20227860//Homo sapiens dickkopf-3 (DKK-3) mRNA, complete cds.//2.30E-69//147aa//92%//AF1 77396 
BRAMY20227960 

BRAMY20231150//PUTATIVE ACID PHOSPHATASE F26C11.1 (EC 3.1 .3. 2)7/2.30 E-55//322aa//39%//Q09549 
BRAMY20234820//Homo sapiens mitotic checkpoint protein (MAD1) mRNA, complete cds.//1.30E-286//561aa// 
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100%//AF123318 
BRAMY20237190 

BRAMY20238630//TETRATRICOPEPTIDE REPEAT PROTEIN 4.//1.20E-147//276aa//99%//095801 
BRAMY20243120 

5 BRAMY20244490//ADENYLATE KINASE ISOENZYME 1 (EC 2.7.4.3) (ATP-AMP TRANS PHOSPHORYLASE) (AK1) 
(MYOKINASE)7/2.50E-1 9//1 1 9aa//37%//P00571 

BRAMY20245140//Rattus norveglcus potassium channel (erg2) mRNA, complete cds.//1 .00E-178//427aa//81%// 

AF016192 

BRAMY20245350 

10 BRAMY20245760//Araneus dladematus fibroin-4 mRNA, partial cds.//7.90E-05//285aa//22%//U47856 

BRAMY20251210//EPHRIN TYPE-A RECEPTOR 7 PRECURSOR (EC 2. 7. 1. 112) (TYROSINE-PROTEIN KINASE 
RECEPTOR EHK-3) (EPH HOMOLOGY KINASE-3) (EMBRYONIC BRAIN KINASE) (EBK) (DEVELOPMENTAL KI- 
NASE 1) (MDK-1).//3.80E-94//268aa//66%//Q61772 

BRAMY20251750//Homo sapiens BRI3 mRNA, complete cds.//2.80E-131//242aa//95%//AF272043 
15 BRAMY20263000//DYSTROPHIA MYOTON IC A-CONTAI N ING WD REPEAT MOTIF PROTEIN (DMR-N9 PRO- 
TEIN ).//2 . 60 E- 1 34//430a a//59%//Q08274 
BRAMY20267780 
BRAMY20269040 
BRAMY20271140 

20 BRAMY20274510//60S RIBOSOMAL PROTEIN L127/1 .10E-39//102aa//82%//P30050 
BRAMY20285650 
BRAMY20287400 

BRAWH2001 4590//ZAKI-4 PROTEIN.//3.1 0E-92//1 87aa//93%//Q1 4206 
BRAWH20020470 
25 BRAWH20020600 

BRAWH20021910//FATTY ACYL-COA HYDROLASE PRECURSOR, MEDIUM CHAIN (EC 3.1.2.14) (THIO ESTE- 
RASE B)7/1 .30E-111//450aa//49%//Q04791 
BRAWH20025490 

BRAWH20026010//AD021 protein [Homo sapiens]//4.00E-55//245aa//44%//NP_057697 
30 BRAWH20027250 

BRAWH20030000 , 

BRAWH20039640//SLIT PROTEIN PRECURSOR.//6.10E-19//282aa//31%//P24014 

BRAWH20040680//PUTATIVE TRANSCRIPTION ELONGATION FACTOR S-ll (TFIIS).//5.90E-06//179aa//29%// 
P52652 
35 BRAWH20047790 

BRAWH20050740//ZINC FINGER PROTEIN 151 (POLYOMAVIRUS LATE INITIATOR PROMOTER BINDING PRO- 
TEIN) (LP-1) (ZINC FINGER PROTEIN Z13)7/1.60E-16//235aa//30%//Q60821 
BRAWH20055240 
BRAWH20055330 
40 BRAWH20055780 
BRAWH20058120 

BRAWH20063010//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//2.60E-06//121 aa//33%// 
Q 15427 

BRAWH20078080 
45 BRAWH20078620 

BRAWH20080580//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/1 .00E-116//316aa//63%//P51523 
BRAWH20082550 

BRAWH20082920//Human TFIIIC Box B-binding subunit mRNA, complete cds.//1 .90E-36//72aa//100%//U02619 
BRAWH20093040//PROTEIN KINASE CLK2 (EC 2.7.1 .-).//2.70E-86//162aa//96%//P49760 
50 BRAWH20093070//SYNAPSIN.//4.80E-06//245aa//28%//Q24546 

BRAWH20094900//Mus musculus mRNA for sialidase, complete cds.//5.70E-73//310aa//50%//AB026842 
BRAWH20095900//ZINC FINGER PROTEIN 184 (FRAGMENT) 7/1 .20E-1 70//631aa//48%//Q 99676 
BRAWH20 173790 

BRAWH20174330//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//4.50E-06//121aa//33%// 
55 Q 15427 

BRAWH20 175230 
BRAWH201 75340 

BRAWH201 76850//Mus musculus mRNA for nuclear protein ZAP, complete cds.//9.50E-1 51//61 9aa//53%//AB0331 68 
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BRAWH20 182670 

BRAWH20183170//GRR1 PROTEIN.//9.30E-13//218aa//28%//P24814 
BRAWH201 85260 
BRAWH201 85270 
5 BRAWH201 86010 

BRAWH20188750//BIOTIN SYNTHESIS PROTEIN BIOC.//5.80E-11//190aa//27%//P36571 

BRAWH20190530//Homo sapiens BNPI mRNA for brain -specific Na-dependent inorganic phosphate cotransporter, 
complete cds7/2.1 0E-1 09//1 1 8aa//1 00%//AB032436 

BRAWH20 1 90550//PUTATI VE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .30E-05//172aa// 
10 29%//P49695 

BRAWH20191980//PROLINE OXIDASE, MITOCHONDRIAL PRECURSOR (EC 1.5.3.-) (PROLINE DEHYDROGE- 
NASE) V/2. 1 0E-1 25//234aa//99%//043272 

BRCAN10000760//UREA TRANSPORTER, ERYTHROCYTE.//1 .30E-212//389aa//100%//Q 13336 
BRCAN10001050//PEANUT-LIKE PROTEIN 2 (BRAIN PROTEIN H5).//1 .40E-62//122aa//98%//043236 
15 BRCAN1 0001 680 
BRCAN20001480 

BRCAN200041 80//alpha-1 C-adrenergic receptor splice form 2 - human//1 .1 0E-22//76aa//76%//S65657 
BRCAN20005230//HEPARIN SULFATE N-DEACETYLASE/N-SULFOTRANSFERASE (EC 2.8.2.-) (N- HSST) 
(N-HEPARIN SULFATE SULFOTRANSFERASE) (GLUCOSAMINYL N-DEACETYLASE/N-SULFOTRANS- 
20 FERASE).//8.90E-1 5//1 68aa//28%//P52849 

BRCAN20005410//Human 1(3)mbt protein homolog mRNA, complete cds.//2.00E-95//378aa//50%//U89358 
BRCOC1 0000400 

BRCOC20000470//Homo sapiens DEME-6 mRNA, partial cds.//7.30E-37//252aa//28%//AF007170 
BRCOC20003600/A/ACUOLAR ATP SYNTHASE SUBUNIT AC45 PRECURSOR (EC 3.6.1.34) (V-ATPASE AC45 
25 SUBUNIT).//5.90E-192//418aa//85%//P40682 
BRHIP1 0000720 

BRHIP10001040//tweety homolog 1 (Drosophila) [Mus musculus]//1 .30E-68//311aa//44%//NP_067299 
BRHIP20000210 
BRHIP20003590 
30 BRHIP20005060 
BRSSN20001970 

BRSSN2000561 0//Mus musculus semaphorin cytoplasmic domain -associated protein 3A (Semcap3) mRNA, complete 

cds.//6.30E-225//730aa//60%//AF1 27084 

BRSSN20005660 

35 BRSSN20066440//ZINC FINGER PROTEIN 202.//3.60E-37//169aa//37%//095125 

BRSSN20074640//HYPOTHETICAL 35.8 KDA PROTEIN IN PRP16-SRP40 INTERGENIC REGION.//4.50E-20// 

217aa//28%//P36163 

BRSSN20091190 

BRSSN20092440 

40 BRSSN20093890//Homo sapiens mRNA for Kelch motif containing protein, complete cds.//8.40E-13//203aa//30%// 
AB026190 

CD34C20001750//MHC CLASS I NK CELL RECEPTOR PRECURSOR (NATURAL KILLER ASSOCIATED TRAN- 
SCRIPT 4) (NEAT-4).//1 .1 0E-18//214aa//3596//P43630 
CTONG1 0000090 
45 CTONG20000340 
CTONG20002790 

CTONG20004110//Mus musculus ankycorbin mRNA, complete cds.//6.20E-55//1 006aa//24%//AF202315 
CTONG20004520/ldevelopment- and differentiation-enhancing factor 2; PYK2 C terminus-associated protein [Homo 
Sapiens].//2.00E-86//31 0aa//81 %//N P_003878 
50 CTONG20007660//Rattus norvegicus caspase recruitment domain protein 9 mRNA, complete cds.//7.30E-28//31 9aa// 
32%//AF311288 

CTONG20008190//YPT1 -RELATED PROTEIN 2.//3.00E-30//160aa//40%//P17609 

CTONG20008460 

CTONG20015240 

55 CTONG20017490//SEMAPHORIN 4A PRECURSOR (SEMAPHORIN B) (SEMA B).//3.10E-273//607aa//82%// 
Q62178 

CTONG20020660 

CTONG20020950//ZINC FINGER PROTEIN 37 (ZFP-37) (MALE GERM CELL SPECIFIC ZINC FINGER PROTEIN).// 
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7.40E-23//258aa//25%//P1 71 41 
CTONG20027660 

CTONG20029030//Homo sapiens Ras-binding protein SUR-8 mRNA, complete cds.//8.30E-25//402aa//28%// 
AF068920 

5 CTONG20030280//VEGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 J/2. 90 E-1 2//303aa//25%//Q 00808 
CTONG20031150 
CTONG20031890 

CTONG20032930//microtubule associated-protein orbit [Drosophila melanogaster]//1 .00E-79//913aa//30%// 
BAA94248 
10 CTONG20033500 

CTONG20033610//Rattus norvegicus SNIP-a mRNA, complete cdsV/2.50 E-1 45//567aa//41%//AF 156981 
CTONG20033750//Drosophila melanogaster AAA family protein Bor (bor) mRNA, complete cds.//1 .40E-174//492aa// 
66%//AF227209 
CTONG20035240 
15 CTONG20036800 

CTON G20036990//BASE M ENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 

PRECURSOR (HSPG) (PERLECAN) (PLC).//1 .10E-10//247aa//27%//Q05793 

CTONG20039370 

CTONG20041 1 50//Streptomyces ansochromogenes strain 71 00 SanE (sanE) gene, complete cds.//5.20E-05//1 33aa// 
20 35%//AF228524 

CTONG20041260//Mus musculus retinoic acid-responsive protein (Stra6) mRNA, complete cds.//1 .30E-238//602aa// 
74%//AF062476 

CTON G20042640//N EURO BLAST DIFFERENTIATION ASSOCIATED PROTEIN AHNAK (DESMOYOKIN) (FRAG- 
MENTS). //0//797aa//7396//Q 09666 
25 CTONG20044230//Mus musculus zinc finger protein (MtsM ) mRNA, partial cds.//1 .40E-289//601 aa//89%//AF1 91 309 
CTONG20044870 

CTONG20046500//ANION EXCHANGE PROTEIN 3 (CARDIAC/BRAIN BAND 3-LIKE PROTEIN) (CAE3/BAE3).// 
2.00E-1 9//276aa//30%//P48751 
CTONG20046690 
30 CTONG20049480 
CTONG20050490 

CTONG20051100//PUTATIVE METHYLTRANSFERASE (EC 2. 1 . 1 . -).//3.50E-29//72aa//87%//043709 
CTONG20051450//testis development protein PRTD [Homo sapiens].//9.00E-50//140aa//85%//AAG33852 
CTONG20052780//Homo sapiens mRNA for SH3 binding protein, complete cds.//B.O0E-21//125aa//42%//AB005047 
35 CTONG20053990//ZINC FINGER PROTEIN 1 95.//4.30E-08//40aa//75%//014628 
CTONG20055670 

CTONG20055850//Rattus norvegicus golgi peripheral membrane protein p65 (GRASP65) mRNA, complete cds.// 
1 . 1 0E-99//248aa//78%//AF01 5264 
CTONG20056150 
40 CTONG20057750 
CTONG20057950 

CTONG20059130//Mus musculus prominin-like protein mRNA, partial cds.//7.50E-103//259aa//7796//AF128113 
CTONG20060040 
CTONG20061290 
45 CTONG20062730 

CTONG20063770//M-PHASE PHOSPHOPROTEIN 9 (FRAGMENT).//1.80E-96//184aa//100%//Q99550 
CTONG20063930//BETA-CHIMAERIN (BETA-CHIMERIN).//3.50E-31//189aa//34%//Q03070 
CTONG20065240 
CTONG20065680 

50 CTONG200661 1 0//Homo sapiens DEME-6 mRNA, partial cds.//8.70E-1 64//557aa//53%//AF0071 70 

CTONG20068360//MITOCHONDRIAL CARNITINE/ACYLCARNITINE CARRIER PROTEIN (CARNITINE/ACYLCAR- 

NITINE TRANSLOCASE) (CAC).//8.00E-30//248aa//35%//P97521 

CTONG20069320 

CTONG20069420 
55 CTONG20070090 

CTONG20070720//N-CHIMAERIN (NC) (N-CHIMERIN) (ALPHA CHIMERIN) (A-CHIMAERIN).//9.20E-25//180aa// 
33%//P30337 

CTONG20070780//SPERM-SPECIFIC ANTIGEN 2 (CLEAVAGE SIGNAL-1 PROTEIN) (CS- 1)7/1. 90 E-122//249aa// 
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97%//P28290 

CTONG2047091 0//Homo sapiens mRNA for 26S proteasome subunit p55, complete cds.//7.70E-227//400aa//1 00%// 
AB003103 

CTONG20071040//BETA CRYSTALLIN B2 <BP).//6.80E-25//195aa//34%//P26775 
5 CTONG20071 680//H YPOTH ETICAL 33.6 KDA PROTEIN IN TDK-PRFA INTERGENIC REGION 7/7.40E-1 4//328aa// 
23%//P45869 

CTONG20072930//ZINC FINGER PROTEIN 41 (FRAGMENT)V/4.10E-216//542aa7/69%//P51814 
CTONG20073990 

CTONG20074000//Mus musculus teashirt 2 (Tsh2) gene, partial cds.//0//1024aa//89%//AF207880 
10 CTONG20074170 
CTONG20074740 
CTONG20076230 

CTONG20076810//site-1 protease of sterol regulatory element binding proteins [Cricetulus griseus]//2.80E-245// 
463aa//93%//AF078l05 
15 CTONG20077760//SYNAPSIN I.//1 .70E-08//209aa//32%//P1 7599 

CTONG20078340//SUP PRESSOR PROTEIN SRP40 7/4.1 0E-08//282aa//26%//P32583 

CTONG20079590//ALPHA-N-ACETYLGALACTOSAMINIDE ALPHA-2.6-SIALYLTRANSFERASE (EC 2.4.99.-) 
(ST6GALNACIII) (STY).//1 .80E-151//305aa//85%//Q64686 

CTONG20080140//HYPOTHETICAL60.3 KDA PROTEIN R13G10.2 IN CHROMOSOME lll.//8.40E-29//179aa//40%// 
20 Q21988 

CTONG20081840 
CTONG20083430 

CTONG20083980/A/ASODILATOR-STIMULATED PHOSPHOPROTEIN (VASP).//4.50E-10//113aa//34%//P50552 
CTONG20084020 

25 CTONG20084660//ZINC FINGER PROTEIN 165.//3.30E-33//142aa//57%//P49910 

CTONG20085210//MONO- AND DIACYLGLYCEROL LIPASE PRECURSOR (EC 3.1.1.-) (MDGL).//3.60E-06//94aa// 
34%//P25234 
CTONG201 33720 
CTONG201 65590 

30 CTONG20165750//SON PROTEIN (SON3).//4.40E-239//427aa//99%//P 18583 
CTONG201 66580 
CTONG201 67750 
CTONG201 68240 
CTONG201 68460 

35 CTONG20169040//KERATIN, TYPE I CYTOSKELETAL 15 (CYTOKERATIN 15) (K15) (CK 15).//6.00E-112//223aa// 
99%//P19012 
CTONG201 69530 

CTONG20170940//MYOTROPHIN (V-1 PROTEIN) (GRANULE CELL DIFFERENTIATION PROTEIN).//4.60E-10// 
93aa//36%//P80144 
40 CTONG20174290//TRICHOHYALIN.//1.30E-07//340aa//21%//P37709 
CTONG201 74440 

CTONG20174580//Homo sapiens mRNA for vascular Rab-GAP/TBC-containing protein complete cds7/1 .OOE-115// 
335aa//61 %//NP_008994 

CTONG20176040//ADP-RIBOSYLATION FACTOR-LIKE PROTEIN 3 (ARD3).//1 .70E-34//155aa//43%//P37996 
45 CTONG201 79390 

CTONG201 79890 

CTONG201 79980 

CTONG201 80620 

CTONG201 80690 
50 CTONG20181350 

CTONG20183430//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID).//4.60E-30//311aa// 
32%//Q01484 

CTONG20183830//IRLB [Homo sapiensJ//1.50E-104//191aa//100%//CAA45013 
CTONG20184130 

55 CTONG20184830//ATP-BINDING CASSETTE, SUB-FAMILY A, MEMBER 1 (ATP-BINDING CASSETTE TRANS- 
PORTER 1) (ATP-BINDING CASSETTE 1).//1.30E-63//271aa//47%//P41233 
CTONG20186140 

CTONG201 86290//ALDEHYDE DEHYDROGENASE, DIMERIC NADP-PREFERRING (EC 1.2.1.5) (ALDH CLASS 
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3).//1 .50E-74//1 44aa//1 00%//P30838 

CTONG20186370//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.60E-52//324aa//33%//P51523 
CTONG20186520//ZINC FINGER PROTEIN MFG-37/1 .40E-197//643aa//53%//P1 6374 
CTONG20186550//cca3 protein - rat//2.10E-37//141aa//56%//T31081 
5 CTONG201 88080 

CTONG20189000//PROBABLE GUANINE NUCLEOTIDE REGULATORY PROTEIN TIM (ONCOGENE TIM) (P60 
TIM) (TRANSFORMING IMMORTALIZED MAMMARY ONCOGEN E).//1.10E-48//222aa//50%//Q 12774 
CTONG20190290//39.1 KDA PROTEIN IN SURE-CYSC INTERGENIC REGION7/6.00E-15//132aa//31%//Q57261 
CTONG201 90630 

10 DFNES20016470//Homo sapiens SDP1 protein mRNA, complete cds.//4.90E-33//95aa//37%//AF076957 

DFNES20018000//CADHERIN-RELATED TUMOR SUPPRESSOR PRECURSOR (FAT PROTEIN) 7/8. 70 E-23// 
306aa//30%//P33450 

DFNES20025500//Homo sapiens mRNA for paraplegin-like protein.//3.00E-29//68aa//94%//Y18314 
DFNES200281 70//Mouse mRNA for RNA polymerase I associated factor (PAF53), complete cds.//1 .40E-1 65//393aa// 
15 78%//D 14336 

DFNES20029660 
DFNES20032550 
DFNES20043710 

DFNES20046840//FORKHEAD BOX PROTEIN E1 (FORKHEAD-RELATED PROTEIN FKHL15) (THYROID TRAN- 
20 SCRIPTION FACTOR 2) (TTF-2).//6.20E-05//1 51 aa//32%//000358 

DFNES20055400//Homo sapiens diphthamide biosynthesis protein-2 (DPH2) mRNA, complete cds7/5.40E-203// 
41 3aa//91 %//AF053003 

DFNES20057660//GRAVE'S DISEASE CARRIER PROTEIN (GDC) (MITOCHONDRIAL SOLUTE CARRIER PRO- 
TEIN HOMOLOG).//7.00E-31//247aa//33%//Q01888 
25 D FN ES20063460//PAB-D EP EN DENT POLY(A)-SPECIFIC RIBONUCLEASE SUBUNIT PAN3 (EC 3.1.13.4) 
(PAB1 P-DEPENDENT POLY(A)-NUCLEASE).//1 .90E-23//1 1 5aa//43%//P361 02 

DFNES20072990//HYPOTHETICAL46.7 KDA PROTEIN IN HOR7-COX7 INTERGENIC REGION.//1 .80E-22//310aa// 
25%//Q04835 

DFNES20073320//Mus musculus RING-finger protein MURF mRNA, complete cdsV/6.70E-118//362aa//61%// 
30 AF294790 

DFNES20076340 

D FN ES20080880//POLY PEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (EC 2.4.1.41) (PROTEIN- UDP 
ACETYLGALACTOSAMINYLTRANSFERASE) (UDP-GALNAC:POLYPEPTIDE, N- ACETYLGALACTOSAM I N YL- 
TRANSFERASE) (GALNAC-T1).//2.00E-104//486aa//43%//Q10472 
35 DFNES20088810 

DFNES20094820//coronin-like protein [Schizosaccharomyces pombe]//3.60E-20//333aa//24%//CAB11184 
FCBBF10000230//H. sapiens mRNA from TYL gene.//3.30E-155//650aa//51%//X99688 
FCBBF1 0002200 

FCBBF10004760//Homo sapiens GAP-like protein (N61) mRNA, complete cds.//9.10E-82//412aa//44%//AF251038 
40 FCBBF20018680//RABPHILIN-3A.//1.70E-16//262aa%/30%//P47709 
FCBBF20020440 
FCBBF20021110 

FCBBF20023490//PUTATIVE PRE-MRNA SPLICING FACTOR ATP-DEPENDENT RNA HELICASE 
SPAC10F6.02C.//1 .20E-90//505aa//39%//042643 
45 FCBBF20028980 
FCBBF20029280 
FCBBF20032930 

FCBBF20033360//RING CANAL PROTEIN (KELCH PROTEIN).//7.40E-33//234aa//32%//Q04652 

FCBBF20035430//Mus musculus arsenite inducible RNA associated protein (Airap) mRNA, complete cds.//1 .50E-51// 
50 152aa//57%//AF224494 

FCBBF20035490//GAP-associated tyrosine phosphoprotein p62 (Sam68) [Homo sapiens] >pir1IA38219 GAP-associ- 

ated tyrosine phosphoprotein p62//1 .50E-214//415aa//93%//NP_006550 

FCBBF20036360 

FCBBF20038230 
55 FCBBF20038950 

FCBBF20041380 

FCBBF20043730 

FCBBF20054390 
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FCBBF20056580//Mus musculus NSD1 protein mRNA, complete cds7/3.40E-304//773aa//75%//AF064553 

FCBBF20059660 

FCBBF20061310 

FCBBF20066340//Homo sapiens nuclear dual-specificity phosphatase (SBF1) mRNA, partial cds.//1 .20E-68//312aa// 
5 49%//U93181 

FCBBF20070800 

FCBBF20070950//M ICRONUCLEAR LINKER HISTONE POLYPROTEIN (MIC LH) [CONTAINS: LINKER HISTONE 
PROTEINS ALPHA, BETA, DELTA AND GAMMA].//5.00E-10//601aa//20%//P40631 
FCBBF30000010 
10 FCBBF30001020 

FCBBF30001100//CRAG protein [Drosophila melanogaster]//7.40E-185//800aa//46%//CAA76938 
FCBBF30001150 

FCBBF30002270//HISTONE HV (H1.0) (H1(0)).//4.90E-62//154aa//84%//P07305 

FCBBF30002280//THIOREDOXIN PEROXIDASE 2 (THIOREDOXIN-DEPENDENT PEROXIDE REDUCTASE 2) 
15 (PROLIFERATION-ASSOCIATED PROTEIN PAG) (NATURAL KILLER CELL ENHANCING FACTOR A) (NKEF-A).// 
1 .20E-27//61 aa//98%//Q06830 
FCBBF30002330 

FCBBF30003810//2INC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//9.10E-93//313aa//53%// 
Q05481 

20 FCBBF30004340//Homo sapiens GalNAc-T9 mRNA for UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase, 
complete cdsV/1 .60E-109//299aa//63%//AB040672 
FCBBF30004730 
FCBBF30005180 

FCBBF30005360//Mus musculus spermatogenesis associated factor (SPAF) mRNA, complete cds.//0//894aa//84%// 
25 AF049099 

FCBBF30005500//HYPOTHETICAL PROTEIN KIAA01 67.//5.80E-1 6//1 24aa//36%//Q99490 

FCBBF30019140//CHROMODOMAIN HELICASE-DNA-BINDING PROTEIN 3 (CHD-3) (MI-2 AUTOANTIGEN 240 
KDA PROTEIN) (MI2-ALPHA).//0//725aa//82%//Q12873 

FCBBF30019180//SERINE/THREONINE PROTEIN PHOSPHATASE 2A, 65 KDA REGULATORY SUBUNIT A, AL- 
SO PHA ISOFORM (PP2A, SUBUNIT A, PR65-ALPHA ISOFORM) (PP2A, SUBUNIT A, R1 -ALPHA ISO FORM). //4. 60 E- 
233//451 aa//98%//P5461 2 
FCBBF30019240 

FCBBF30021900//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//4.1 0E-1 61//633aa//48%// 
Q05481 

35 FCBBF30022680//putative 5'-3* exonuclease//9.00E-1 2//200aa//25%//AAG29662 

FCBBF30026580//Homo sapiens retinoblastoma-associated protein RAP 140 mRNA, complete cds.//7.60E-27// 
367aa//28%//AF1 80425 

FCBBF30029250//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//4.00E-18//754aa//23%//P08640 
40 FCBBF30035570 

FCBBF30042610//Homo sapiens CTL2 gene.//2.10E-137//393aa//60%//AJ245621 
FCBBF30048420//TBX19 PROTEIN (T-BOX PROTEIN 19).//1.40E-94//212aa//85%//060806 
FCBSF30053300//Human autoantigen pericentriol material 1 (PCM-1) mRNA, complete cds.//0//708aa//90%//L27841 
FCBBF30056980 

45 FCBBF30062490//MUS musculus prominin-like protein mRNA, partial cds.//7.70E-85//210aa//79%//AF128113 
FCBBF30063990 
FCBBF30068210 

FCBBF30071 500//Homo sapiens dentin phosphoryn mRNA, complete cds.//2.80E-09//675aa//22%//AF094508 
FCBBF30072440//Homo sapiens SARDH mRNA, alternatively spliced, complete cds.//1 .70E-14//81aa//53%// 
50 AF095737 

FCBBF30072480 
FCBBF30074530 
FCBBF30074620 
FCBBF30075970 

55 FCBBF30076310//CAMP-DEPENDENT PROTEIN KINASE, BETA-CATALYTIC SUBUNIT (EC 2.7.1 .37) (PKA C-BE- 
TA).//8.20E-1 66//240aa//1 00%//P22694 
FCBBF30078600 
FCBBF30079770 
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FCBBF30080730//SPLICING FACTOR, ARGININE/SERINE-RICH 7 (SPLICING FACTOR 9G8).//3.40E-70//136aa// 

95%//Q16629 

FCBBF30081 000 

FCBBF30085560//HYPOTH ETICAL 60.3 KDA PROTEIN R13G10.2 IN CHROMOSOME III.//1 .10E-87//531aa//38%// 
5 Q21988 

FCBBF30088700 
FCBBF30089380 
FCBBF30091010 

FCBBF30091 520//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1,4-ALPHA-GLUCOSIDASE) 
10 (1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE)7/3.70E-09//631 aa//21 %//P08640 

FCBBF30093170//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.10E-63//173aa//65%//P51523 

FCBBF30095410 

FCBBF30099490 

FCBBF30100080//ARF NUCLEOTIDE-BINDING SITE OPENER (ARNO PROTEIN) (ARF EXCHANGE FACTOR).// 
15 1 .1 0E-57//1 08aa//1 00%//Q9941 8 

FCBBF301 00120//Mus musculus semaphorin cytoplasmic domain-associated protein 3A (Semcap3) mRNA, complete 
cds7/2.10E-192%/769aa//50%//AF127084 

FCBBF30100410//Mus musculus testis-specific Y-encoded-iike protein (Tspyll) mRNA, complete cds.//1.90E-56// 
324aa//42%//AF042180 
20 FCBBF30101240 
FCBBF30101300 
FCBBF301 05080 

FCBBF30105440//Rattus norvegicus ion transporter protein (NRITP) mRNA, partial cds.//3.40E-36//82aa//91%// 
AF1 84921 

25 FCBBF30105860//microtubule associated-protein orbit [Drosophila melanogaster].//1 .00E-79//556aa//33%// 
BAA94248 
FCBBF301 06950 

FCBBF30107290//MITOCHONDRIAL PROCESSING PEPTIDASE ALPHA SUBUN IT PRECURSOR (EC 3. 4. 24. 64) 
(ALPHA-MPP) (P-55) (HA1523) (KIAA01 23)7/1 .00E-91//172aa//100%//Q1 071 3 
30 FCBBF30 107330 
FCBBF30114180 

FCBBF30114850//Homo sapiens C2H2 (Kruppel-type) zinc finger protein mRNA, complete cds7/3.20E-24//249aa// 
34%//AF1 59567 
FCBBF301 15230 

35 FCBBF301 1 5920//Homo sapiens nolp mRNA, complete cds.//9.40E-220//257aa//1 00%//AB01 7800 

FCBBF30118670//Homo sapiens disintegrin and metalloproteinase domain 19 (ADAM 19) mRNA, partial cds.//0// 
601 aa//97%//AF1 34707 

FCBBF30118890//Drosophila melanogaster La related protein (larp) mRNA, partial cds.//6.70E-25//221aa//35%// 
AF221108 
40 FCBBF301 25460 

FCBBF30125880//Homo sapiens single-strand selective monofunctional uracil DNA glycosylase mRNA, complete 
cds.//6.40E-81//96aa//1 00%//AF1 251 82 
FCBBF301 28420 

FCBBF301 2901 0//ZINC FINGER PROTEIN 36 (ZINC FINGER PROTEIN KOX1 8) (FRAGMENT).//1 .20E-1 79//322aa// 
45 100%//P17029 

FCBBF301 3041 0//CALDESMON (CDM).//3.30E-06//1 70aa//32%//P 12957 
FCBBF301 30580 

FCBBF30132050//Homo sapiens mRNA for UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta-galactosyltrans- 
ferase7/2.1 0E-43//253aa//36%//Y1 501 4 
50 FCBBF30132660//Drosophila melanogaster Canton S tartan protein (trn) mRNA, complete cds7/2.00E-15//293aa// 
30%//U 02078 

FCBBF30135890//GLUTENIN, LOW MOLECULAR WEIGHT SUBUNIT PRECURSOR.//2.60E-07//163aa//34%// 
P 10385 

FCBBF30136230//NIL-2-A ZINC FINGER PROTEIN (NEGATIVE REGULATOR OF IL2) (TRANSCRIPTION FACTOR 
5 5 8)7/0//1 090aa//94%//P37275 

FCBBF30138000//trg protein - raV/1 .30E-82//560aa//37%//l60486 

FCBBF30142290//dJ127B20.3 (novel PHD finger protein) [Homo Sapiens] .//1.00E-140//287aa//96%//CAB62994 
FCBBF30143550//FYVE FINGER-CONTAINING PHOSPHOINOSITIDE KINASE (EC 2.7.1.68) (1-PHOSPHATIDYLI- 



188 



EP 1 308 459 A2 



NOSITOL^-PHOSPHATE KINASE) (PIP5K) (PTDINS(4)P-5- KINASE) (P235).//0//1027aa//91%//Q9Z1T6 

FCBBF301 45670 

FCBBF30151190 

FCBBF301 531 70//6-PHOSPHOFRUCTOKINASE, LIVER TYPE (EC 2.7.1.11) (PHOSPHOFRUCTOKINASE 1) 
5 (PHOSPHOHEXOKINASE) (PHOSPHOFRUCTO-1 -KINASE ISOZYME B)7/0//670aa//99%//P 17858 
FCBBF30157270//Rattus norvegicus PAPIN mRNA, complete cds.//1.20E-179//639aa//58%//AF1 69411 
FCBBF30161780 

FCBBF301 6451 0//RETINAL-CADHERIN PRECURSOR (R-CADHERIN) (R-CAD).//0//794aa//98%//P55283 
FCBBF30166220//SERINE HYDROXYMETHYLTRANSFERASE, CYTOSOLIC (EC 2.1.2.1) (SERINE METHYLASE) 
10 (GLYCINE HYDROXYMETHYLTRANSFERASE) (SHMT).//2.70E-12//33aa//100%//P34896 

FCBBF30169280//Petuniax hybrida PGPD14 (PGPD14) mRNA, complete cds.//1.40E-62//261aa//42%//AF049930 
FCBBF301 69870 
FCBBF301 70710 

FCBBF301 71 230//N EURO ENDOCRINE CONVERTASE 2 PRECURSOR (EC 3.4.21.94) (NEC 2) (PC2) (PROHOR- 
15 MONE CONVERTASE 2) (PROPROTEIN CONVERTASE 2) (KEX2-LIKE ENDOPROTEASE 2)7/1 .70E-82//1 81 aa// 
86%//P16519 
FCBBF301 72330 

FCBBF30173960//erythroid differentiation-related factor 1 [Homo sapiens].//6.00E-32//113aa//100%//AAC00001 
FCBBF301 75350//MITO GEN -ACTIVATED PROTEIN KINASE 7 (EC 2.7.1 .-) (EXTRACELLULAR SIGNAL- REGULAT- 
20 ED KINASE 5) (ERK5) (ERK4) (BMK1 KINASE).//4.60E-06//245aa//28%//Q13164 

FCBBF30177290//HYPOTHETICAL47.6 KDA PROTEIN C16C10.5 IN CHROMOSOME lll.//2.40E-28//133aa//46%// 
Q09251 

FCBBF30179180 
FCBBF301 79740 
25 FCBBF30181730 
FCBBF301 94370 

FCBBF30194550//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2),//9.90E-54// 
478aa//33%//P16157 

FCBBF301 95690//SYNAPTOTAGMIN I.//1 .30E-27//138aa//31%//P34693 
30 FCBBF301 95700 

FCBBF301 97840//Mouse mRNA for seizure-related gene product 6 type 2 precursor, complete cds.//0//869aa//91%// 
D64009 

FCBBF30198670//dof protein - fruit fly (Drosophila melanogaster)//6.60E-05//272aa//24%//T13712 
FCBBF30201 630//Chlamydomonas reinhardtii dhd gene for 1 -alpha dynein heavy chain7/1 .10E-121//384aa//5896// 
35 AJ243806 

FCBBF30212210 

FCBBF30215240//mitogen inducible gene mig-2 - human//2.20E-135//263aa//96%//S69890 

FCBBF30220050//OXYSTEROLS RECEPTOR LXR-BETA (LIVER X RECEPTOR BETA) (NUCLEAR ORPHAN RE- 
CEPTOR LXR-BETA) (UBIQUITOUSLY-EXPRESSED NUCLEAR RECEPTOR) (NUCLEAR RECEPTOR NER).// 
40 6.90E-96//1 67aa//1 00%//P55055 

FCBBF30222910//Mus musculus Rap2 interacting protein 8 (RPIP8) mRNA, complete cds.//4.70E-29//76aa//46%// 
U73941 

FCBBF30223110 

FCBBF30223210//PLEXIN 4 PRECURSOR (TRANSMEMBRANE PROTEIN SEX).//1 .70E-72//179aa//76%//P51805 
45 FCBBF30225930 

FCBBF30228940//Homo sapiens zinc finger protein dp mRNA, complete cds.//2.60E-14//114aa//42%//AF1 53201 
FCBBF30230610 

FCBBF30236670//Homo sapiens DEAD-box protein abstrakt (ABS) mRNA, complete cds.//1 .00E-128//276aa//91%// 
AF195417 

50 FCBBF30250980//GUANINE NUCLEOTIDE-BINDING PROTEIN BETA SUBUNIT-LIKE PROTEIN.//3.20E-06// 
190aa//28%//024076 

FCBBF30255680//Rattus norvegicus brain specific cortactin-binding protein CBP90 mRNA, partial cdsV/1 .80E-275// 
641aa//82%//AF053768 

FCBBF30257370//CARNITINE DEFICIENCY-ASSOCIATED PROTEIN EXPRESSED IN VENTRICLE 1 (CDV-1 PRO- 
55 TEIN).//2.80E-1 69//355aa//92%//035594 

FCBBF30259050//Mus musculus (clone pMLZ-1) zinc finger protein (Zfp) mRNA, 3' end of cds.//1 .40E-241//499aa// 
83%//L36315 

FCBBF30260210//Drosophila melanogaster KISMET-L long isoform (kis) mRNA, complete cds.//3.90E-178//420aa// 
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68%//AF215703 

FCBBF30260480//Mus musculus putative E1-E2 ATPase mRNA, partial cds7/1.80E-78//154aa//95%//AF 156547 
FCBBF30263080//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//4.20E-33//107aa//58%//P51522 
FCBBF30266510 

5 FCBBF30271 990//ANKYRIN 1 (ERYTHROCYTE ANKYRIN).//3.00E-43//41 9aa//33%//Q02357 

FCBBF30275590//dedlcator of cyto-kinesis 1 [Homo Sapiens].//1.00E-138//791aa//37%//NP_001371 
FCBBF30282020//cca3 protein - rat//5.50E-249//492aa//94%//T31081 

FCBBF30285930//ZINC FINGER PROTEIN ZFP-1 (MKR1 PROTEIN)V/5.70E-68//125aa//97%//P08042 
FCBBF30287940 

10 FCBBF4000061 0//late gestation tung 2 protein [Rattus norvegicus].//5.00E-86//1 78aa//94%//AAF44721 
FCBBF40001920 
FCBBF40005000 
FCBBF50000410 
FCBBF50000610 

15 FCBBF50001650//Homo sapiens JP3 mRNA for junctophilintype3, complete cds.//1 .20E-111//407aa//57%//AB042636 
FCBBF50003530//H. sapiens mRNA for dinG gene.//2.70E-1 37//1 81 aa//1 00%//Y1 0571 
FCBBF50004950 

FEBRA20005040//MYOSIN HEAVY CHAIN, STRIATED MUSCLE. //1 .90E-1 3//479aa//23%//P24733 
FEBRA20007820//MLN 64 PROTEIN (CAB1 PROTEIN).//7.00E-31//129aa//51%//Q14849 
20 FEBRA20018670 

FEBRA20026820//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//2.20E-135//431aa//50%// 
Q05481 

FEBRA20027070//ZINC FINGER PROTEIN 41 (FRAGMENT).//1 .00E-139//333aa//70%//P51 814 
FEBRA20029620 

25 FEBRA20031 000//TRICHOHYALIN.//2.20E-1 6//360aa//26%//P37709 

FEBRA20031150//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds.//2.00E-29//63aa//100%//AF226053 

FEBRA20031280 

FEBRA20031810 

FEBRA20035200 

30 FEBRA20035240//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//7.30E-05//108aa//31%// 
Q15427 

FEBRA20038220//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 

(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//7.50E-05//256aa//24%//P08640 

FEBRA20038330 

35 FEBRA20038970//Homo sapiens mRNA for stabilin-1 (stabl gene).//1 .30E-242//41 3aa//99%//AJ275213 
FEBRA20039070 

FEBRA20039260//NonF [Streptomyces griseus subsp. griseus]7/2.20E-16//140aa//38%//AAD37457 
FEBRA20040230 
FEBRA20040260 
40 FEBRA20040290 

FEBRA20040560//Homo sapiens delta-6 fatty acid desaturase (CYB5RP) mRNA, complete cds.//1 .70E-112//204aa// 
100%//AF 134404 

FEBRA20045380//EVI-5 homolog [Homo sapiens].//7.00E-49//130aa//81%//AAC 16031 

FEBRA20046200//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID).//1 .70E-25//368aa// 
45 30%//Q01484 

FEBRA20046280 

FEBRA20046510//ZINC FINGER PROTEIN 135.//1 .10E-94//260aa//62%//P52742 
FEBRA20057010//ZINC FINGER PROTEIN 195.//1 .30E-12//47aa//70°///0 14628 

FEBRA20063720//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//3.10E-243//586aa//73%// 
50 Q03923 

FEBRA20076200 
FEBRA20078180 

FEBRA20078800//NADH-UBIQUINONE OXIDOREDUCTASE 20 KDA SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 1. 
6. 99. 3) (COMPLEX I-20KD) (CI-20KD) (PSST SUBUNIT).//2.30E-96//192aa//96%//075251 
55 FEBRA20080860 
FEBRA20082660 
FEBRA20083410 
FEBRA20084750 
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FEBRA20086600 

FEBRA20087550//damage-specrfic DNA binding protein 2 (48kD) [Homo sapiens]//1 .10E-106//119aa//94%// 
NP_000098 

FEBRA2008861 0//CELLULAR RETINALDEHYDE-BINDING PROTEIN (CRALBP).//2.40E-14//145aa//30%//P10123 
5 FEBRA20088810//FIBR0BL GROWTH FACTOR-17 PRECURSOR (FGF-1 7)7/1 .00E-1 02//1 93aa//99%//060258 

FEBRA200901 60//M ITOG EN -ACTIVATED PROTEIN KINASE KINASE KINASE 10 (EC 2.7.1.-) (MIXED LINEAGE 
KINASE 2) (PROTEIN KINASE MST).//1 .10E-15//111aa//48%//Q02779 

FEBRA20090220//TRANSLATION INITIATION FACTOR EIF-2B EPSILON SUBUNIT(EIF-2BGDP-GTP EXCHANGE 
FACTOR).//0//721 aa//90%//P47823 
10 FEBRA20091620 

FEBRA20092760//PINCH PROTEIN ( PARTI CULARY INTERESTING NEW CYS-HIS PROTEIN).//2.50E-81//165aa// 
80%//P48059 
FEBRA20093270 
FEBRA20093280 
15 FEBRA20095410 
FEBRA20098040 

FEBRA20099860//dynactin 3 (p22); dynactin light chain [Homo Sapiens]//1 .70E-41//89aa//100%//NP_009165 
FEBRA201 01410 
FEBRA201 08020 
20 FEBRA201 08580 
FEBRA201 15930 
FEBRA201 16650 
FEBRA201 21200 

FEBRA20121950//X INACTIVE SPECIFIC TRANSCRIPT PROTEIN (FRAGMENT) 7/1 .60E-07//155aa//24%//P27571 
25 FEBRA20141980 

FEBRA201 50420//HYPOTHETICAL 131 .5 KDA PROTEIN C02F12.7 IN CHROMOSOME X7/6.90E-56//877aa//24%// 
Q11102 

FEBRA20151750//Mus musculus (clone E5.53) Huntington disease (hdh) gene, exon 57/2.60E-12//88aa//43%// 
L34024 
30 FEBRA201 63980 

FEBRA20170240//ZINC FINGER PROTEIN 757/7.90E-158//278aa//99%//P51815 

FEBRA201 72230//Mus musculus Schwann oma-associated protein (SAM9) mRNA, complete cds.//1 .70E-57//295aa// 
40%//AF026124 

FEBRA20173330//PROTEIN KINASE CLK3 (EC 2.7.1 .-) 7/4.80 E-277//490aa//99%//P49761 
35 FEBRA201 75020 
FEBRA201 75330 

FEBRA20177800//RNA binding motif protein 9 [Homo sapiens]7/4.00E-09//75aa//95%//NP_055124 
FEBRA201 80510 
FEBRA201 82030 
40 FEBRA201 87460 

FEBRA2Q191720//REGULATOR OF G-PROTEIN SIGNALING 11 (RGS11).//2.00E-73//104aa//100%//094810 
HCHON1 00001 50//SKELETAL MUSCLE LIM-PROTEIN 1 (SLIM 1)(SLIM) (FOUR AND A HALF LIM DOMAINS PRO- 
TEIN 1) (FHL-1)7/4.10E-74//154aa//84%//Q13642 
HCHON1 0001 660 

45 HCHON20000870//SERINE/THREONINE-PROTEIN KINASE CTR1 (EC 2.7.1 .37)7/2. 10 E-21//300aa//26%//Q0560 9 
HCHON20002650//EARLY GROWTH RESPONSE PROTEIN 2 (EGR-2) (KROX-20 PROTEIN).//9.90E-05//166aa// 
26%//P51774 

HCHON20002710//UBIQUITIN CARBOXYL-TERM INAL HYDROLASE 13 (EC 3.1.2.15) (UBIQUITIN THIOLESTE- 
RASE 13) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 13) (DEUBIQUITINATING ENZYME 13)7/7.40E-10// 
50 114aa//28%//P38187 

HCHON20015050//LEUKOCYTE ADHESION GLYCOPROTEIN P150.95 ALPHA CHAIN PRECURSOR (LEUKO- 
CYTE ADHESION RECEPTOR P1 50,95) (CD11 C) (LEU M5) (INTEGRIN ALPHA-X)7/8.60E-06//250aa//26%//P20702 
HEART10001420//Mus musculus skm-BOP1 (Bop) mRNA, complete cds7/6.10E-259//485aa//94%//U76373 
HEART10001490//ACTIN INTERACTING PROTEIN 27/1.80E-71//243aa//58%//P46681 
55 HEART20009590//Homo sapiens mRNA for paraplegin-like protein.//7.10E-47//145aa//67%//Y1 8314 

HEART20019310//Mus musculus RING-finger protein MURF mRNA, complete cds.//6.70E-118//362aa//61%// 
AF294790 

HEART20022200//METHIONINE AMINOPEPTIDASE 2 (EC 3.4.11.18) (METAP 2) (PEPTIDASE M 2) (INITIATION 
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FACTOR 2 ASSOCIATED 67 KDA GLYCOPROTEIN) (P67).//1 .50E-209//447aa//86%//P50579 
HEART20031680 

HEART20047640//CALCIUM/CALMODULIN-DEPENDENT 3\5'-CYCLIC NUCLEOTIDE PHOSPHODIESTERASE 
1C (EC 3.1.4.17) (CAM-PDE 1C)7/0//769aa//94%//Q63421 
5 HEART20063100//H. sapiens mRNA histone RNA hairpin-binding protein.//5.60E-114/%212aa//100%//Z71188 

HEART20082570//AMINOMETHYLTRANSFERASE PRECURSOR (EC 2.1.2.10) (GLYCINE CLEAVAGE SYSTEM T 

PROTEIN).//5.50E-1 1 3//21 0aa//69%//P28337 

HHDPC10001140 

HHDPC20051850//STEROID RECEPTOR PROTEIN DG6.//9.50E-43//101aa//89%//015173 
10 HHDPC20081230//NUCL (PROTEIN C23).//0//681aa//92%//P 19338 
HHDPC20082790 
HHDPC20082970 
HHDPC20088160 

HLUNG20008460//DIAPHANOUS PROTEIN HOMOLOG 2.//7.60E-33//521 aa//26%//060879 
15 HLUNG20009260 
HLUNG20009550 
HLUNG20010130 

HLUNG20011260//TYROSINE-PROTEIN KINASE SRC-1 (EC 2.7.1.112) (P60-SRC-1)V/1.10E-46//92aa//100%// 
P13115 
20 HLUNG20011440 

HLUNG20011460//Rattus norvegicus serine-arginine-rich splicing regulatory protein SRRP86 mRNA, complete cds.// 

1 .20E-159//398aa//79%//AF234765 

HLUNG20012140 

HLUNG20014590//ZINC FINGER PROTEIN 1357/1 ,20E-122//350aa//59%//P52742 
25 HLUNG20015070//SLIT PROTEIN PRECURSOR.//5.00E-14//167aa//33%//P24014 

HLUNG20015180//BALBIANI RING PROTEIN 3 PRECURSOR. //8. 80 E-08//444aa//24%//Q 03376 
HLUNG20020500 

HLUNG20020850/7TLM PROTEIN (TLM ONCOGENE).//5.00E-17//91aa//54%//P17408 
HLUNG20021450 
30 HLUNG20023030 
HLUNG20024050 
HLUNG20025620 

HLUNG20028110//zinc finger protein - fission yeast (Schizosaccharomyces pombe).//2,70E-23//140aa//38%//T39456 
HLUNG20029420 
35 HLUNG20029490 

HLUNG20030420//Mus musculus mRNA for MAIL, complete cds.//1 .00E-164//728aa//68%//AB02O974 
HLUNG20030490//Ambystomatigrinum RPE65 protein mRNA, complete cdsy/1.70E-64//335aa//42%//AF047465 
HLUNG20030610 
HLUNG20031620 

40 HLUNG20032460//LYSOSOMAL PRO-X CARBOXYPEPTIDASE PRECURSOR (EC 3.4.16.2) (PROLYLCARBOX- 
Y PEPTIDASE) (PRCP) (PROLINE CARBOXYPEPTIDASE) (ANGIOTENSINASE C) (LYSOSOMAL CARBOX- 
YPEPTIDASE C).//8.60E-274//440aa//99%//P42785 

HLUNG20033060//Homo sapiens GAP-like protein (N61) mRNA, complete cds.//1 .20E-81//389aa//45%//AF251038 
HLUNG20033310 
45 HLUNG20033350 
HLUNG20034970 
HLUNG20037140 

HLUNG20037160//RETRO VIRUS-RELATED ENV POLYPROTEIN.//1 .90E-131//439aa//54%//P10267 
HLUNG20037780 
50 HLUNG20038330 

HLUNG20041 540//GLU CO AMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//8.80E-08//286aa//24%//P08640 

HLUNG20041590//ubiquitous tetratricopeptide containing protein RoXaN [Homo sapiens].//1 .00E-158//737aa//42%// 
AAF05541 

55 HLUNG20042730//CYTOCHROME P450 4A4 (EC 1 .14.14.1) (CYPIVA4) (PROSTAGLANDIN OMEGA- HYDROXY- 
LASE) (P450-P-2) .//4.90 E-1 26//442aa//49%//P 1 061 1 

HLUNG20045340//MOB2 PROTEIN (MPS1 BINDER 2).//4.60E-27//135aa//37%//P43563 
HLUNG20047070 
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HLUNG20050760 
HLUNG20051330 

HLUNG20052300//AIG1 PROTEIN.//3.00E-23//21 6aa//30%//P54120 

HLUNG20054790//PHOSPHATIDYLINOSITOL 3-KINASE REGULATORY SUSUNIT (EC 2.7.1.137) (IB PI3-KINASE 
5 P101 SUBUNIT) (PTDINS-3-KINASE P101) (PI3K) (P101-PI3K).//8.30E-22//292aa//25%//002696 

HLUNG20055240 

HLUNG20056560 

HLUNG20057380 

HLUNG20059240 
10 HLUNG20060670 

HLUNG20063700//H.sapiens PEBP2aC1 acute myeloid leukaemia mRNA.//3.60E-22//62aa//79%//Z35278 
HLUNG20065700 

HLUNG20065990//SYNTAXIN4.//8.80E-127//267aa//96%//Q12846 
HLUNG20067810 

15 HLUNG20068120//NUCLEAR TRANSITION PROTEIN 2 (TP-2)7/7.10E-06//86aa//38%//P11101 
HLUNG20069350//CALCYPHOSINE7/1 .80E-13//128aa//31%//Q 13938 
HLUNG20070410 

HLUNG20072100//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds.//1 .40E-236//404aa//79%//AF1 98349 
HLUNG20072190 
20 HLUNG20072450 
HLUNG20074330 
HLUNG20079310 

HLUNG20081390//DNAJ PROTEIN.//1 .60E-1 7//98aa//47%//P35515 

HLUNG20081530//N EURO FILAMENT TRIPLET H PROTEIN (200 KDA NEUROFILAMENT PROTEIN) (NF-H).// 
25 1 .90E-09//220aa//25%//P1 2036 

HLUNG20082350//Homo sapiens goodpasture antigen-binding protein (COL4A3BP) mRNA, complete cds.//0//399aa// 
93%//AF1 36450 

HLUNG20083330//alphal (III) collagen [Homo Sapiens]//5.40E-61//113aa//99%//CAA29886 

HLUNG20083480//Chicken mRNA for TSC-22 variant, complete cds, clone SLFEST52.//9.00E-178//527aa//68%// 
30 D82364 

HLUNG20083840 
HLUNG20083960 

H LU NG20084790//H YPOTH ETI C AL 65.2 KDA TRP-ASP REPEATS CONTAINING PROTEIN D2030.9 IN CHROMO- 
SOME l7/5.50E-47//161aa//53%//P90794 

35 HLUNG2008521 0//Homo sapiens intersectin 2 (SH3D1 B) mRNA, complete cds7/9.60E-28/1 62aa//95%//AF248540 
HLUNG20088750 
HLUNG20092530 
HLUNG20093030 
HLUNG20094130 

40 KIDNE20011600 

KIDNE20016360//Rattus norvegicus potassium channel (erg2) mRNA, complete cds7/0//418aa//96%//AF016192 

KIDNE20024380 

KIDNE20027980 

KIDNE20080690//PROBABLE AMINOTRANSFERASE T01 B11 .2 (EC 2.6.1 .-).//5.80E-11 4//445aa//48%//P9 1408 
45 KIDNE20081170//Homo sapiens microtubule-based motor (HsKIFC3) mRNA, complete cds.//6.20E-153//216aa// 
99%//AF004426 
KIDNE20083150 

K I D N E20083620//L- AS PA RAG I N AS E (EC 3.5.1 .1) (L-ASPARAGINE AM I DO HYDROLASE) 7/2. 60 E-53//148aa//43%// 
Q9ZSD6 
50 KIDNE20084030 

KIDNE20084040//PHOSPHOLIPASE D1 (EC 3.1 .4.4) (PLD 1) (CHOLINE PHOSPHATASE 1) (PHOSPHATIDYLCHO- 
LINE-HYDROLYZING PHOSPHOLIPASE D1).//3.30E-70//134aa//100%//Q13393 

KIDNE20084730//Homo sapiens FH1/FH2 domain-containing protein FHOS (FHOS) mRNA, complete cds.//2.60E- 
1 48//599aa//52%//AF1 1 361 5 
55 KIDNE20084800 
KIDNE20086490 
KIDNE20086660 
KIDNE20086970 



193 



EP 1 308 459 A2 



KIDNE20087880 

KIDNE20088240//atopy related autoantigen CALC [Homo sapiens].//1 .00E-26//300aa//26%//CAA76830 
KIDNE20089870//HISTONEACETYLTRANSFERASE TYPE BSUBUNIT2 (RETINOBLASTOMA BINDING PROTEIN 
P46) (RETINOBLASTOMA-BINDING PROTEIN 7).//4.30E-237//422aa//99%//Q16576 
5 KIDNE20091090 
KIDNE20094260 

K!DNE20094670//Drosophila melanogaster AAA family protein Bor (bor) mRNA, complete cds.//2.10E-124//399aa// 

59%//AF227209 

KIDNE20095530 

10 KIDNE20133460//Homo sapiens mRNA for sperm protein.//3.40E-146//284aa//100%//X91879 
KIDNE20133880 
KIDNE20134130 

KIDNE20134890//CENTROMERIC PROTEIN E (CENP-E PROTEIN).//7.70E-05//169aa//21%//Q02224 
KIDNE20137310 
15 KIDNE201 38450 

KIDNE20140870//zinc finger protein 106 [Mus musculus]//2.10E-288//822aa//67%//AF060246 
KIDNE20141120 

KIDNE20141700//40S RIBOSOMAL PROTEIN S4, X ISOFORM (SINGLE COPY ABUNDANT MRNA PROTEIN) 
(SCR1 0)7/2.60 E-72//1 53aa//89%//P1 2750 
20 KIDNE20142680 

KIDNE20142900//THROMBOMODULIN PRECURSOR (FETOMODULIN) (TM) (CD141 ANTIGEN).//1 .80E-71// 

119aa//100%//P07204 

KIDNE201 43200 

KIDNE20147170//acetylglutamate synthase - fission yeast (Schizosaccharomyces pombe)//8.40E-15//143aa//37%// 
25 T40666 

KIDNE201 48080 

KIDNE20149780//NG28 [Mus musculus]//3.50E-66//367aa//44%//AAC97966 

KIDNE20150730//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1).//2.40E-06//84aa//41%//P49646 
KIDNE20152440//Homo sapiens mRNA for serin protease with IGF-binding motif, complete cds.//1 .80E-1 81//388aa// 
30 93%//D87258 

KIDNE20154330//Rattus norvegicus mRNA for multi PDZ domain protein y/0//763aa//87%//AJ00 1320 

KIDNE20154830 

KIDNE20155980 

KIDNE20157100 

35 KIDNE20160360//ARF NUCLEOTIDE-BINDING SITE OPENER (ARNO PROTEIN) (ARF EXCHANGE FACTOR).// 
7. 1 0E-40//1 94aa//41 %//Q9941 8 
KIDNE201 60960 
KIDNE201 63710 

KIDNE20165390//Homo sapiens mRNA for beta-tubulin folding cofactor D.//0//709aa//94%//AJ006417 
40 KIDNE20169180//UROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCOPROTEIN) (THP).//0// 
615aa//99%//P07911 
KIDNE201 70400 

KIDNE20173150//Bos taurus mRNA for mitochondrial aralkyl acylCoA:amino acid N-acyftransferase.//2.90E-53// 
277aa//40%//AJ223301 

^5 KIDNE20173430//Homo sapiens PDZ domain containing-protein (PDZK1) mRNA, complete cds.//7.90E-28//150aa// 
34%//AF012281 
KIDNE201 76030 
KIDNE20181670 
KIDNE201 82540 

50 KIDNE20186170//UDP-GLUCURONOSYLTRANSFERASE 2B13 PRECURSOR, MICROSOMAL (EC 2.4.1.17) (UD- 
PGT) (EGT10).//4.40E-38//214aa//39%//P36512 
KIDNE201 88630 

KIDNE20189890//Homo sapiens mRNA for KARP-1 -binding protein 2 (KAB2), complete cds.//6.00E-30//1 77aa//44%// 
AB022658 

55 KIDNE20189960//TREHALASE PRECURSOR (EC 3.2.1 .28) (ALPHA, ALPHA-TREHALASE) (ALPHA, ALPHA-TRE- 
HALOSE GLUCOHYDROLASE)7/1 .40E-224//421aa//97%//043280 
KIDNE20191870 

LIVER20006260//Mus musculus zinc finger protein ZFP113 mRNA, complete cds 7/4.50 E-183//385aa//8596// 
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AF1 67320 
LIVER20007690 
UVER20007750 
LIVER20010510 

5 LIVER20010760//Homo sapiens C-type lectin-like receptor-1 mRNA, complete cds.//3, 10E-134//208aa//100%// 
AF200949 

LIVER20010990//Rattus norvegicus mRNA for putative integral membrane transport protein (UST1 r).//7. 00E-52// 
196aa//54%//Y09945 

LIVER20011640//Human proline rich calmodul in -dependent protein kinase mRNA, complete cds.//2.00E-116//221aa// 
10 97%//U23460 
LIVER20013890 

LIVER20026440//CYTOCHROME P450 4F3 (EC 1.14.13.30) (CYPIVF3) (LEUKOTRIENE-B4 OMEGA- HYDROXY- 
LASE) (LEUKOTRIENE-B4 20-MONOOXYGENASE) (CYTOCHROME P450- LTB-OMEGA).//2.60E-136//295aa// 
84%//Q08477 

15 LIVER20030650//WHITE PROTEIN.//7.20E-09//229aa//25%//Q05360 
LIVER20032340 

LIVER20038000//MITOCHONDRIAL CARNITINE/ACYLCARNITINE CARRIER PROTEIN (C ARN ITI N E/AC YLCAR- 
NITINE TRANSLOCASE) (CAC)7/9.40E-40//148aa//38%//P97521 

LIVER20040740//RETINAL-BINDING PROTEIN (RALBP)7/3.10E-60//337aa//37%//P49193 
20 LIVER20055270//SELENIDE, WATER Dl KINASE 2 (EC 2.7.9.3) (SELENOPHOSPHATE SYNTHETASE 2) (SELENI- 
UM DONOR PROTEIN 2).//2.70E-204//376aa//97%//Q99611 

MESAN20006200//ANNEXIN II (LIPOCORTIN II) (CALPACTIN I HEAVY CHAIN) (CHROMOBINDIN 8) (P36) (PRO- 
TEIN I) (PLACENTAL ANTICOAGULANT PROTEIN IV) (PAP-IV).//1 .70E-84//1 74aa//95%//P07355 
MESAN20007110 
25 MESAN20008150 
MESAN20008940 

MESAN20009090//Homo sapiens CEGP1 protein (CEGP1), mRNA//1 .10E-1 79//553aa//58%//NM_020974 
MESAN20016270//ZINC FINGER PROTEIN 37A (ZINC FINGER PROTEIN KOX21) (FRAGMENT) 7/1 .60E-141// 
242aa//100%//P17032 

30 MESAN20021130//Human SH3 domain-containing proline-rich kinase (sprk) mRNA, complete cds7/8.20E-168// 
346aa//91%//U07747 

MESAN20021220//INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN COMPLEX ACID LABILE CHAIN PRE- 
CURSOR (ALS)7/7.40E-26//1 97aa//30%//002833 

MESAN20021470//SPINDLIN (30000 MR METAPHASE COMPLEX) (SSEC P).//3.50E-123//229aa//98%//Q61142 
35 MESAN20021860 
MESAN20026870 

MESAN20027240//Rho guanine nucleotide exchange factor (GEF) 10 [Homo sapiens].//1.00E-134//620aa//40%// 
NP_055444 

M ESAN20027900//COLLAG EN ALPHA 3(VI) CHAIN PRECURSOR//0//1001aa//98%//P12111 
40 MESAN20029780 

MESAN20030350//Mus musculus diaphanous-related formin (Dia2) mRNA, complete cds.//6.60E-301//669aa//84%// 

AF094519 

MESAN20030370 

MESAN20030390 

45 MESAN20033220//ALDEHYDE DEHYDROGENASE 7 (EC 1 .2.1 .5)7/1 .60E-24//54aa//1 00%//P43353 

MESAN20034440//39.1 KDA PROTEIN IN SURE-CYSC INTERGENIC REGION.//2.70E-07//11 7aa//31%//Q57261 
M ES AN20038520//D N A-D I RECTE D RNA POLYMERASE ill 128 KDA POLYPEPTIDE (EC 2.7.7.6) (RNA POLYMER- 
ASE III SUBUNIT 2)7/0//831aa//70%//P25167 
MESAN20041380 

50 MESAN20045750 

MESAN20056890//SPLICING FACTOR, ARGININE/SERINE-RICH 2 (SPLICING FACTOR SC35) (SC-35) (SPLICING 
COMPONENT, 35 KDA) (PR264 PROTEIN).//3.30E-12//97aa//48%//Q01130 
MESAN20057240//DNA EXCISION REPAIR PROTEIN ERCC-1.//5.90E-120//195aa//9896//P07992 
MESAN20058110//65 KDA FK506-BINDING PROTEIN PRECURSOR (EC 5.2.1.8) (FKBP65) (FKBPRP) (PEPTI- 
55 DYL-PROLYL CIS-TRANS ISOMERASE) (PPIASE) (ROTAMASE) (IMMUNOPHILIN FKBP65)7/8.00E-117//229aa// 
89%//Q61576 

MESAN20059570//Rattus norvegicus mRNA for seven transmembrane receptor, complete cds.//1 .00E-173//484aa// 
63%//BAA82518 
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MESAN20060220 
MESAN20060430 

MESAN20065990//Human protein serine/threonine kinase stk2 mRNA, complete cds 7/2. 40 E-07//65aa//50%//L20321 
MESAN20067430//TROPOMYOSIN, FIBROBLAST ISOFORM TM3.//1 .80E-39//87aa//100%//P09494 
5 MESAN20069530//LIM domain only 7 isoform c [Homo Sapiens]//2.20E-286//545aa//99%//NP_056667 

MESAN20084150//Mus musculus secretory carrier membrane protein 4 mRNA, complete cds.//2.20E-48//128aa// 

72%//AF224721 

MESAN20085360 

MESAN20089260 

10 MESAN20090190//CEGP1 protein [Homo sapiens]7/0//880aa//57%//NP_066025 
MESAN20094180 
MESAN20095220 

M ESAN20095800//P EPTI DYL- P RO LYL CIS-TRANS ISOMERASE 1 0 (EC 5.2.1 .8) (PPIASE) (ROTAMASE) (CYCLO- 
PHILIN-10).//1.60E-31//150aa//46%//P52017 
15 NESOP20004520//LYMPHOCYTE-SPECIFIC PROTEIN LSP1 (PP52 PROTEIN) (52 KDA PHOSPHOPROTEIN) 
(LYMPHOCYTE-SPECIFIC ANTIGEN WP34).//3.40E-173//321aa//99%//P33241 
NESOP20005040 
NT2NE20018740 

NT2NE20018890//Homo sapiens WD-repeat protein 6 (WDR6) mRNA, complete cds.//6.60E-184//257aa//99%// 
20 AF099100 

NT2NE20021 860//Lytechinus variegatus embryonic blastocoelar extracellular matrix protein precursor (ECM3) mRNA, 
complete cds7/9.00E-68//466aa//36%//AF287478 

NT2NE20026200//TRANSKETOLASE (EC 2.2.1.1) (TK).//1 .80E-160//310aa//99%//P29401 
NT2NE20026510//basic protein, cytosolic - fruit fly (Drosophila melanogaster)//6.10E-35//202aa//41%//S47857 
25 NT2NE20028700 
NT2NE20033150 

NT2NE20037050//U2 SMALL NUCLEAR RIBONUCLEOPROTEIN AUXILIARY FACTOR 35 KDA SUBUNIT RELAT- 
ED-PROTEIN 2.//7.20E-08//109aa//38%//Q15696 

NT2NE20038870//ZINC FINGER X-LINKED PROTEIN ZXDA (FRAGMENT).//5.90E-153//405aa//72%//P981 68 
30 NT2NE20039210 

NT2NE20042550//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4.3) (ATP-AMP TRANSPHOSPHORYLASE).// 
7.80E-1 5//1 53aa//28%//P431 88 
NT2NE20045190 
NT2NE20047870 
35 NT2NE20053230 

NT2NE20053950//ZINC FINGER PROTEIN 1 367/6. 40E-108//284aa//64%//P52737 
NT2NE20059210 

NT2NE20059680//Homo sapiens integrin cytoplasmic domain associated protein (lcap-1a) mRNA, complete cds7/ 
1 .80E-44//96aa//1 00%//AF01 2023 
40 NT2NE20060750//ZINC FINGER PROTEIN ZFP-36 (FRAGMENT).//! .20E-69//198aa//68%//P16415 
NT2NE20061030//ZINC FINGER PROTEIN 1657/3. 00 E-39//1 25aa//65%//P499 10 
NT2NE20062880 

NT2NE20064780//A-AGGLUTININ ATTACHMENT SUBUNIT PRECURSOR.//4.10E-05//443aa//24%//P32323 
NT2NE20066590 
45 NT2NE20069580 
NT2NE20070520 
NT2NE20073650 

NT2NE20077250//Homo sapiens cell cycle progression 2 protein (CPR2) mRNA, complete cds.//1 .20E-1 73//349aa// 
94%//AF011792 
50 NT2NE20077270 
NT2NE20077860 

NT2NE20079670//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/2.60E-98//345aa//47%//P51523 

NT2NE20080770 

NT2NE20082130 

55 NT2NE20082600//Homo sapiens zinc finger protein dp mRNA, complete cds.//2.10E-19//163aa//42%//AF1 53201 
NT2NE20086070 

NT2NE20087270//Homo sapiens putative RNA binding protein mRNA, alternatively spliced, complete cds.//4.30E-1 4// 
221aa//29%//AF119121 
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NT2NE20087850//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT).//! .80E-08// 

75aa//40%//P40603 

NT2NE20088030 

NT2NE20092950 

5 NT2NE20095230//Homo sapiens HSKM-B (HSKM-B) mRNA, complete cds.//1 .40E-09//112aa//32%//AF226053 
NT2NE201 04000 
NT2NE201 07810 

NT2N E201 08420//KES 1 P ROTE IN.//4.70E-25//3 1 2aa//3 1 %//P35844 
NT2NE2011119Q//C-TERMINAL BINDING PROTEIN 2.//9.00E-54//137aa//84%//P56545 
10 NT2NE201 12210 
NT2NE201 14850 

NT2NE20117580//NADH-UBIQUINONE OXIDOREDUCTASE 20 KDA SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 
1 .6.99.3) (COMPLEX I-20KD) (CI-20KD) (PSST SUBUNIT).//1 .20E-76//153aa//9596//075251 

NT2NE20119980//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1, ENDOTHELIAL (B12 PROTEIN):// 
15 6.1 0E-48//1 35aa//74%//Q1 3829 
NT2NE201 23610 
NT2NE201 24570 
NT2NE201 26030 

NT2NE201 27900//Homo sapiens myo-lnositol 1 -phosphate synthase A1 (ISYNA1 ) mRNA, complete cds7/1 .40E-269// 
20 465aa//99%//AF220530 

NT2NE20140130//SEMAPHORIN 3B PRECURSOR (SEMAPHORIN V) (SEMA V).//1 .90E-42//90aa//100%//Q 13214 
NT2NE201 40280 

NT2NE20141040//DOWN SYNDROME CRITICAL REGION PROTEIN 1 .//1.80E-105//197aa//99%//P53805 
NT2NE20145250//SCG10 PROTEIN (SUPERIOR CERVICAL GANGLION-10 PROTEIN).//2.50E-58//141aa//88%// 
25 P55821 

NT2NE20146510//HYPOTHETICAL 104.7 KDA PROTEIN F23F12.8 IN CHROMOSOME III PRECURSOR.//6.20E- 
08//1 79aa//23%//P46504 
NT2NE201 48690 
NT2NE201 49500 
30 NT2NE201 50610 
NT2NE201 52620 

NT2NE20153620//mitogen inducible gene mig-2 - human//7.90E-147//449aa//63%//S69890 
NT2NE20155650//RETROVIRUS-RELATED ENV POLYPROTEIN.//2.30E-29//114aa//33%//P10267 
NT2NE20157120 
35 NT2NE20165190 

NT2NE201 67660//Mus musculus nuclear localization signal binding protein (spoM ) mRNA, complete cds.//3.50E-09// 
76aa//40%//S79410 

NT2NE20173970//Rattus norvegicus beta-catenin binding protein mRNA, complete cds.//1.60E-24//134aa//46%// 
AF1 69825 

*o NT2NE20177210//Leishmania major partial ppgl gene for proteophosphoglycan.//1 .10E-06//169aa//27%//AJ243460 

NT2NE20181760 

NT2NE20181800 

NT2NE201 84720 

NT2RI20016240 
45 NT2RI20021200 

NT2RI20033920 

NT2RI20093010//BIFUNCTIONAL METHYLENETETRAHYDRO FOLATE DEHYDROGENASE/CYCLOHYDROLA- 
SE, MITOCHONDRIAL PRECURSOR [INCLUDES: NAD-DEPENDENT METHYLENETETRAHYDROFOLATE DEHY- 
DROGENASE (EC 1.5.1.15); METHENYLTETRAHYDROFOLATE CYCLOHYDROLASE (EC 3.5.4.9)] 7/5. 90 E-34// 
50 86aa//81%//P13995 

NT2RP70001120//GA BINDING PROTEIN BETA-1 CHAIN (GABP-BETA-1 SUBUNIT) (GABPB1).//2.50E-113// 
384aa//62%//Q00420 

NT2RP70001730//Mus musculus actin-binding protein (ENC-1) mRNA, complete cds.//4.30E-249//589aa//74%// 
U 65079 

55 NT2RP70003110//ELASTIN PRECURSOR (TROPOELASTIN).//1 .40E-165//613aa//61%//P15502 
NT2RP70012830//CALPHOTIN.//7.90E-1 7//445aa//28%//Q0291 0 
NT2RP70022820 

NT2RP70027790//RAS SUPPRESSOR PROTEIN 1 (RSU-1) (RSP-1 PROTEIN) (RSP-1).//3.80E-23//186aa//34%// 
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Q01730 

NT2RP70029780//ZINC FINGER PROTEIN ZFP-36 ( FRAGMENT) .//4.30E-118//381aa//56%//P1 641 5 
NT2RP70030840//Mus musculus schlafen3 (S1fn3) mRNA, complete cds7/3.70E-55//328aa//33%//AF099974 
NT2RP70031070//36 KDA NUCLEOLAR PROTEIN HNP36 (DELAYED-EARLY RESPONSE PROTEIN 12) (DER12).// 
5 1.20E-23//169aa//34%//Q61672 
NT2RP70031340 
NT2RP70031480 

NT2RP70035110//Caenorhabditis elegans UNC-89 (unc-89) gene, complete cds.//1 .50E-07//229aa//26%//U33058 
NT2RP7004641 0//BASONUCLIN7/3.60E-71//31 8aa//43%//Q01 954 
10 NT2RP70049610 
NT2RP70056290 

NT2RP70056690//F-SPONDIN PRECURSOR.//2.20E-15//366aa//24%//P35447 
NT2RP70057500//Hypothetical zinc finger-like protein [Homo sapiens]7/0//799aa//94%//AAF88107 
NT2RP70064570//CALPAIN P94, LARGE [CATALYTIC] SUBUNIT (EC 3.4.22.17) (CALCIUM- ACTIVATED NEUTRAL 
15 PROTEINASE) (CANP) (P94 PROTEIN) (MUSCLE-SPECIFIC CALCIUM-ACTIVATED NEUTRAL PROTEASE 3 
LARGE SUBUNIT).//9.40E-86//278aa//40%//Q64691 
NT2RP70074800 

NT2RP70075300//ZINC FINGER PROTEIN 211 (ZINC FINGER PROTEIN C2H2-25).//9.60E-121//333aa//63%// 
Q 13398 

20 NT2RP70075800//HYPOTHETICAL 43.1 KDA TRP-ASP REPEATS CONTAINING PROTEIN K04G11.4 IN CHRO- 
MOSOME X.//1 .80E-1 3//244aa//25%//Q93847 
NT2RP70080150 
NT2RP70084540 

NT2RP70087140//SKIN SECRETORY PROTEIN XP2 PRECURSOR (APEG PROTEIN).//1 .40E-11//264aa//31%// 
25 P17437 

NT2RP70090870//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//4.20E-230//592aa//61%// 
Q05481 

NTONG20002230//Mus musculus RW1 protein mRNA, complete cds 7/5. 20 E-97//546aa//34%//AF0 60565 
NTONG20005310 
30 NTONG20017620 

NTONG20029850//CALCYPHOSINE (R2D5 ANTIGEN).//1 .60E-24//183aa//32%//P41150 

NTONG20031 580//heat shock 27kD protein family, member 7 (cardiovascular); cardiovascular heat shock protein [Ho- 
mo sapiens]//6.1 0E-69//141 aa//95%//NP_055239 

NTONG20032100//KERATIN, TYPE I CYTOSKELETAL 13 (CYTOKERATIN 13) (K13) (CK 13)7/4.20E-175//351aa// 
35 96%//P13646 

NTONG20034540//CGMP-DEPENDENT 3\5'-CYCLIC PHOSPHODIESTERASE (EC 3. 1.4. 17) (CYCLIC GMP 
STIMULATED PHOSPHODIESTERASE) (CGS-PDE)7/0//713aa//99%//000408 
NTONG20035150//RING CANAL PROTEIN (KELCH PROTEIN)7/9.10E-30//570aa//25%//Q04652 
NTONG20043080//MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE AND NON-MUSCLE ISOZYMES (EC 
40 2.7.1.117) (MLCK) [CONTAINS: TELOKIN]7/4.30E-12//226aa//28%//P11799 

NTONG20048440//P116 RH0- INTER ACTING PROTEIN (P116RIP) (RIP3).//1 .60E-269//588aa//87%//P97434 
NTONG20049180 

NTONG20053630//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE)7/1 .30E-12//247aa//29%//P08640 
45 NTONG20053730//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 64E (EC 3.1 .2.15) (UBIQUITIN THIOLESTE- 
RASE 64E) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 64E) (DEUBIQUITINATING ENZYME 64E)7/8.70E- 
1 04//340aa//60%//Q24574 

NTONG20053910//Bos taurus differentiation enhancing factor 1 (DEF-1) mRNA, complete cds.//2.50E-198//891aa// 
4596//AF1 12886 

50 NTONG20055200//ELONGATION FACTOR G, MITOCHONDRIAL PRECURSOR (MEF-G)7/1 .10E-269//522aa// 
83%//Q07803 

NTONG2005801 0//ACETYL-CO ENZYME A SYNTHETASE (EC 6.2.1 .1 ) (ACETATE-COA LIGASE) (ACYL- ACTIVAT- 
ING ENZYME).//3.90E-124//398aa//54%//068040 

NTONG20058220//Homo sapiens phosphoprotein pp75 mRNA, partial cds.//8.60E-14//349aa//25%//AF1 53085 
55 OCBBF20000740//Homo sapiens mRNA for ISLR, complete cds.//3.60E-87//287aa//59%//AB0031 84 
OCBBF20001780 

OCBBF20005220//Rattus norvegicus Fos-reiated antigen mRNA, complete cds.//3.40E-233//552aa//84%//U34932 
OCBBF20009820 
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OCBBF20011860//Mus musculus epithelial protein lost in neoplasm-a (Eplin) mRNA, complete cds.//4.30E-33//98aa// 
66%//AF307844 

OCBBF20012520//Homo sapiens mRNA for ISLR, complete cds.//6.50E-88//287aa//59%//AB003184 
OCBBF20016390 

5 OCBBF2001 681 0//enhancer of polycomb [Mus musculus]//0//743aa//92%//AF079765 
OCBBF201 09450 
OCBBF201 09780 

OCBBF20110210//KILON PROTEIN PRECURSOR (KINDRED OF IGLON).//4.30E-182//352aa//94%//Q9ZOJ8 
OCBBF201 10730 

10 OCBBF20111370//BCL2/ADENOVIRUS E1B 19-KDA PROTEIN-INTERACTING PROTEIN 2.//5.70E-84//324aa// 
52%//054940 

OCBBF20111600//69 KDA ISLET CELL AUTO ANTIGEN (ICA69) (ISLET CELL AUTOANTIGEN 1)7/1 .30E-113// 
335aa//57%//Q05084 

OCBBF20112280//Mesembryanthemum crystallinum phosphoenolpyruvate/phosphate translocator precursor (PPT) 
15 mRNA, complete cds7/7.00E-09//1 1 3aa//30%//AF223359 
OCBBF201 12320 
OCBBF20113110 
OCBBF201 15360 

OCBBF20116250//Mus musculus C2H2-type zinc finger protein (Evi9) mRNA, complete cds7/0//592aa//99%// 
20 AF051525 

OCBBF201 17220 
OCBBF201 18720 
OCBBF20119810 

OCBBF20120010//Homo sapiens zinc metalloprotease ADAMTS6 (ADAMTS6) mRNA, complete cds.//1 .60E-44// 

25 21 7aa//42%//AF1 40674 

OCBBF20120950//ZINC FINGER PROTEIN 151 (POLYOMAVIRUS LATE INITIATOR PROMOTER BINDING PRO- 
TEIN) (LP-1) (ZINC FINGER PROTEIN Z1 3)7/1 . 00 E-28//201aa//34%//Q 60821 
OCBBF20121910//LAF-4 PROTEIN (LYMPHOID NUCLEAR P ROTE IN)7/0//740aa//98%//P5 1826 
OCBBF201 23200 

30 OCBBF201 42290 
OCBBF201 47070 
OCBBF201 52330 
OCBBF201 55030 

OCBBF20156450//ZINC FINGER PROTEIN 757/8.20E-163//289aa//99%//P51815 

35 OCBBF20157970//ZINC FINGER PROTEIN 1357/2.80E-98//306aa//56%//P52742 

OCBBF20160380//liver stage antigen LSA-1 - Plasmodium falciparunV/1 .10E-21//938aa//24%//A45592 
OCBBF20165900//CELL SURFACE ANTIGEN 114/A10 P RECU RSO R7/4. 80 E-09//1 45aa//3 1 %//P 1 9467 
OCBBF20165910//Mus musculus pecanex 1 mRNA, complete cds7/4.20E-116//407aa//53%//AF096286 
OCBBF20166890//RESTIN (CYTOPLASMIC LINKER PROTEIN-170) (CLIP-170)7/4.20E-21//124aa//45%//042184 

40 OCBBF20166900//ZINC FINGER PROTEIN CLONE 647 (FRAGMENT)7/9.00E-18//196aa//30%//P15622 

OCBBF20167290//probable acyl-CoA dehydrogenase - Deinococcus radiodurans (strain R1)//2.50E-72//222aa//60%// 
D75616 

OCBBF20170350//Mus musculus mRNA for GATS protein7/2.50E-56//121aa//96%//AJ296173 
OCBBF20174580//G2/MITOTIC-SPECIFIC CYCLIN S13-7 (B-LIKE CYCLIN) (FRAGMENT)7/7.10E-1 6//240aa//25%// 
45 P25012 

OCBBF20174890//ankyrin 3, long splice form - human//1 .10E-150//178aa//100%//A55575 

OCBBF20175360//Homo sapiens C2H2 (Kruppel-type) zinc finger protein mRNA, complete cds7/3.80E-11//101aa// 
36%//AF1 59567 
OCBBF201 76650 

50 OCBBF201 77540//ZINC FINGER PROTEIN MLZ-4 (ZINC FINGER PROTEIN 46)7/1 .40E-11 0//223aa//86%//Q 03309 
OCBBF201 77910 

OCBBF20182060//Mesocricetus auratus guanine nucleotide-binding protein beta 5 (Gnb5) mRNA, complete cds7/ 
7.20E-82//265aa//61 %//U 1 31 52 
OCBBF201 85630 
55 OCBBF201 88280 

OCBBF20191950/A/ERY LOW-DENSITY LIPOPROTEIN RECEPTOR PRECURSOR (VLDL RECEPTOR) 7/0// 
720aa//97%//P98155 

PANCR1 00008 60//ELASTASE 1MB PRECURSOR (EC 3.4.21.70) (PROTEASE E)7/1 .10E-52//87aa//97%//P08861 
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PEBLM1 0001470//glutamine (Q)-rich factor 1 , QRF-1 [mice, B-cell leukemia, BCL1 , Peptide Partial, 84 aa]//5.00E-42// 
84aa//98%//AAB29272 

PEBLM20001800//IG ALPHA-1 CHAIN C REGION7/7.90E-197//353aa//100%//P01876 

PEBLM20003260//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1)7/4.90E-62//151aa//70%//P51522 
5 PEBLM20005020 

P LAC E5000 1 290//H Y POTH ETIC AL 87.9 KDA PROTEIN F44G4.1 IN CHROMOSOME II PRECURSORV/2.90E-16// 

102aa//43%//P54073 

PLACE50001390 

PLACE60001910 

10 PLACE60004260//CYSTATIN M PRECURSOR (CYSTATIN E).//1 .50E-37//81aa//97%//Q15828 
PLACE60006300 
PLACE60011180 

PLACE60012620//LYSOSOMAL TRAFFICKING REGULATOR (BEIGE HOMOLOG)7/8.40E-14//128aa//38%// 
Q99698 
15 PLACE60017120 

PLACE60052940//TRANSCRIPT REGULATORY PROTEIN GAL11 7/1 .70E-05//445aa//23%//P32257 
PLACE60053280 

P LAC E60054230//D I AP H AN O U S PROTEIN HOMOLOG 27/1 .00E-35//385aa//26%//060879 
P LAC E60054820//HY POTH ETIC AL PROTEIN KIAA00327/1 .20E-50//180aa//61%//Q15034 
20 PLACE60054870//MYOSIN HEAVY CHAIN, NONMUS TYPE B (CELLULAR MYOSIN HEAVY CHAIN, TYPE B) (NMM- 
HC-B)7/3.70E-1 1//434aa//20%//P35580 
PLACE60055350 

PLACE60055460//Homo sapiens leucine-zipper protein FKSG13 (FKSG13) mRNA, complete cds7/5.40E-164// 
327aa//99%//AF31 2393 

25 PLACE60055590//MYOSIN-BINDING PROTEIN H (MYBP-H) (H-PROTEIN) (86 KDA PROTEIN)7/1.30E-05//124aa// 
35%//Q05623 
PLACE60056910 
PLACE60057860 
PLACE60061370 

30 PLACE60062660//ADP-ribosylation factor binding protein GGA1 ; ADP-ribosylation factor binding protein; Golgi-asso- 
ciated, gamma-adaptin ear containing, ARF-binding protein 1 [Homo sapiens] 7/6. 00E-84//249aa//93%//NP_037497 
PLACE60062870 
PLACE60063940 

PLACE600641 80//PUTATIVE PRE-MRNA SPLICING FACTOR ATP-DEPENDENT RNA 
35 HELICASE7/1 .90E-51//368aa//37%//022899 

PLACE60064740//ADHESIVE PLAQUE MATRIX PROTEIN PRECURSOR (FOOT PROTEIN 1) (MCFP1)7/4.80E-11// 
157aa//31%//Q25434 

PLACE60066970//ZINC FINGER PROTEIN 191 7/1 .30E-36//115aa//48%//01 4754 
PLACE6006871 0//SU PPRESSOR PROTEIN SRP40 7/9.50 E-43//238aa//50%//P32583 
40 PLACE60069880 
PLACE60070500 

PLACE60071800//CORONIN-LIKE PROTEIN P577/3.80E-60//108aa//81%//Q92176 

PLACE60072390 

PLACE60072420 

45 PLACE60073090//Homo sapiens myo-inositol 1 -phosphate synthase A1 (ISYNA1 ) mRNA, complete cds7/6.60E-21 9// 
362aa//98%//AF220530 
PLACE60074820 
PLACE60077870 

PLACE60080360//mucin [Homo sapiens]//5.50E-05//164aa//30%//CAA84032 
50 PLACE60081260 
PLACE60082850 

PLACE60087680//INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR (IGFBP-3) (IBP-3) 
(IGF-BINDING PROTEIN 3)7/2.30E-103//255aa//79%//P 17936 
PLACE60088240 
55 PLACE60092280 
PLACE60092370 
PLACE60093380 
PLACE60095240 
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P LAC E60095600//H Y POTH ETIC AL HELICASE C28H8.3 IN CHROMOSOME HI7/2.70E-28//201aa//38%//Q09475 
PLACE60098350//Human hepatocellular carcinoma associated protein (JCL-1) mRNA, complete cds 7/5.20 E-285// 
558aa//97%//U92544 

PLACE60104630//macrophage migration inhibitory factor (glycosylation-inhibiting factor) [Homo sapiens]//9.70E-51// 
5 110aa//93%//XP_000858 

PLACE601 05680//Homo sapiens mRNA for TU1 2B1 -TY, complete cds7/1 .70E-30//65aa//64%//AB032773 
PLACE60107010//SUPPRESSOR PROTEIN SRP40.//3.80E-05//117aa//29%//P32583 
PLACE601 09910 

PLACE60113340%/BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 
10 PRECURSOR (HSPG) (PERLECAN) (PLC).//5.90E-65//238aa//32%//Q05793 

PLACE60118810//Rattus norvegicus kinesin light chain KLCt mRNA, complete cds.//1 .90E-230//504aa//87%// 
AF1 66267 

PLACE60119700//Homo sapiens mRNA for ABP32, complete cds.//2.30E-21//47aa//1007o//AB018357 
PLACE601 20280//SER/THR-RICH PROTEIN T10 IN DGCR REGION.//6.00E-99//126aa//84%//P54797 
15 PLACE60122970//novel C2H2 type zinc finger protein//5.60E-84//169aa//98%//CAC10457 
PLACE60132200//TRICH0HYALIN.//3.10E-47//297aa//47%//P37709 
PLACE601 32320 
PLACE601 32880 

PLACE601 38840//PUTATIVE MITOCHONDRIAL CARRIER PROTEIN PET87/7.40E-59//274aa//47%//P38921 
20 PLACE60140640//Homo sapiens nucleotide binding protein (NBP) mRNA, complete cds.//3.90E-138//262aa//99%// 
AF208536 

PLACE601 5051 0//NU CLEAR PROTEIN SNF77/1 .60E-11//189aa//25%//P39929 

PLACE601 54450//PUTATIVE PREOPTIC REGULATORY FACTOR-2 PRECURSOR (PORF-2).//7.30E-36//75aa// 
98%//P 18890 
25 PLACE601 55910 
PLACE601 57310 
PLACE60162100 
PLACE601 75640 

PLACE60177880//IGSF5 [Homo sapiens].//3.60E-174//315aa//99%//CAB90447 
30 PLACE601 77910 
PLACE60181870 

PLACE601 8441 0//Mus musculus peroxisomal long chain acyl-CoA thioesterase lb (Pte1 b) gene, exon 3 and complete 
cds.//1 .00E-43//126aa//69%//AF1 80801 

PLACE601 84870//PHOSPHOLIPID TRANSFER PROTEIN PRECURSOR (LIPID TRANSFER PROTEIN 11)7/1 .50E- 
35 227//330aa//99%//P55058 

PLACE60188630//Mus musculus mRNA for acetylglucosaminyltransferase-like protein V/7.80E-08//317aa//23%// 
AJ006278 

PROST10001100//Zea mays clone AGPZml arabinogalactan protein (agp) mRNA, partial cds.//5.80E-05//180aa// 
32%//AF1 34579 
40 PROST1 0001 360 
PROST1 00021 50 

PROST20007170//Hypothetical Kruppel-Type Zinc Finger Protein(R28830_1)//0//432aa//100%//AAC24608 
PROST20007600//PUTATIVE ENDONUCLEASE C1F12.06C (EC 3.1 .-.-) 7/5. 90 E-29//1 34aa//44%//Q 10348 
PROST20011160 
45 PROST20011800 
PROST20014140 
PROST20014150 
PROST20014650 

PROST20015210//MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFORM (SMMHC) (FRAGMENT).//3.00E-1 07// 
50 262aa//85%//P35749 
PROST20015400 

PROST20016760//M-PHASE PHOSPHOPROTEIN 8 ( FRAGMENT) 7/9. 00 E-157//298aa//99%//Q99549 
PROST20022120 

PROST20024250//ZINC FINGER PROTEIN 1367/1 .70 E-45//1 28aa//63%//P52737 
55 PROST20028970//Oryctolagus cuniculus CARP mRNA, complete cds.//4.80E-44//177aa//51%//AF131883 

PROST20033240//EPHRIN TYPE-A RECEPTOR 6 PRECURSOR (EC 2.7.1.112) (TYROSINE-PROTEIN KINASE 
RECEPTOR EHK-2) (EPH HOMOLOGY KINASE-2)7/8.40E-241//441 aa//95%//Q62413 

PROST20035170//Homo sapiens zinc finger protein dp mRNA, complete cds.//3.40E-15//128aa//42%//AF 153201 
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PROST20035830 
PROST20036280 

PROST20036350//MULTI FUNCTIONAL AMINOACYL-TRNA SYNTHETASE [INCLUDES: GLUTAMYL-TRNA SYN- 
THETASE (EC 6.1.1.17) (GLUTAMATE--TRNA LIGASE); PROLYL-TRNA SYNTHETASE (EC 6.1.1.15) (PROLINE* 
5 -TRNA LIGASE)]7/2.20E-137//651aa//42%//P07814 

PROST20039300//Bos taurus mitochondrial mRNA for xenobiotic/medium-chain fatty acid:CoA ligase form XL-Ill.// 

1 .60E-68//1 80aa//68%//AJ1 32751 

PROST20041460 

PROST20042700 

10 PROST20045700//Zea mays clone AGPZml arabinogalactan protein (agp) mRNA, partial cds.//5.80E-05//180aa// 
32%//AF1 34579 
PROST20047440 
PROST20048170 

PROST20050390//CYTOCHROME P450 2J2 (EC 1.14.14.1) (CYPIIJ2) (ARACHIDONIC ACID EPOXYGENASE).// 
15 1 .40E-34//1 88aa//42%//P51 589 

PROST20051310//Homo sapiens DEAD-box protein abstrakt (ABS) mRNA, complete cds.//8.50E-134//257aa//99%// 

AF195417 

PROST20052720 

PROST20052850//CYCLIN G-ASSOCIATED KINASE (EC 2.7.1 .-)7/2.20E-18//107aa//54%//P97874 
20 PROST20054660 

PROST20058860//SMALL NUCLEAR RIBONUCLEOPROTEIN ASSOCIATED PROTEINS B AND B' (SNRNP-B) (SM 

PROTEIN B/B') (SM-B/SM-B').//8.20E-05//134aa//33%//P14678 

PROST20060200 

PROST20062820//TRANSLATION INITIATION FACTOR IF-2.//1.50E-07//102aa//41%//P71613 
25 PROST20063430//BCL2/ADENOVIRUS E1B 19-KDA PROTEIN-INTERACTING PROTEIN 2.//8.50E-74//305aa// 
46%//Q 12982 
PROST20065100 

PROST20065790//6-PHOSPHOFRUCTOKINASE, TYPE (EC 2.7.1 .11) (PHOSPHOFRUCTOKINASE 1) (PHOSPHO- 
HEXOKINASE) (PHOSPHOFRUCTO-1 -KINASE ISOZYME C) (6-PHOSPHOFRUCTOKINASE, PLATELET TYPE).// 
30 0//697aa//99%//Q01813 
PROST20073280 

PROST20075280//MYOSIN LIGHT CHAIN KINASE, SMOOTH MUSCLE AND NON-MUSCLE ISOZYMES (EC 
2.7.1.117) (MLCK) [CONTAINS: TELOKIN].//1.30E-08//245aa//23%//P11799 
PROST20078710 
35 PROST20082430 

PROST20084470//Plasmodium berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds.//2.50E-08// 

122aa//28%//AF000413 

PROST20084680 

PROST20084720//CYTOCHROME P450 4F2 (EC 1 .14.13.30) (CYPIVF2) (LEUKOTRIENE-B4 OMEGA- HYDROXY- 
40 LASE) (LEUKOTRI ENE-B4 20-MONOOXYGENASE) (CYTOCHROME P450- LTB-OMEGA).//1 .50E-37//85aa//85%// 
P78329 

PROST20087240 
PROST20093470 
PROST20094000 
« PROST20097310 
PROST20097360 

PROST20097840//SYNAPSIN I.//1 .80E-09//1 93aa//34%//P1 7599 

PROST20099090//ADAM 12 PRECURSOR (EC 3.4.24.-) (A DISINTEGRIN AND METALLO PROTEINASE DOMAIN 
12) (MELTRIN ALPHA) .//1 .90 E-22//73aa//60%//043 184 PROST20102190//CALMODULIN.//1 .30E-20//98aa//47%// 
50 P02594 

PROST201 02500 
PROST201 03820 

PROST20105450//SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6) (KIAA0267).// 
1 .60E-96//21 4aa//75%//Q92581 
55 PROST201 06060 

PROST20108850//MICROSOMAL SIGNAL PEPTIDASE 23 KDA SUBUNIT (EC 3.4.-.-) (SPC22/23).//1 .20E-69// 

132aa//100%//PI2280 

PROST20110120 
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PROST20114100 

PROST20120070//KINESIN HEAVY CHAIN (UBIQUITOUS KINESIN HEAVY CHAIN) (UKHC).//5.00E-05//286aa// 

24%//P33176 

PROST20121570 

5 PROST20122490//Gallus gallus syndesmos mRNA, complete cds.//1 .20E-63//139aa//84%//AF095446 
PROST201 24000 
PROST201 25420 

PROST20127450//Homo sapiens TSC-22 related protein (TSC-22R) mRNA, complete cds.//7.90E-44//95aa//98%// 
AF1 53603 
10 PROST201 30320 
PROST201 38730 

PROST20146590//MUCIN 1 PRECURSOR (POLYMORPHIC EPITHELIAL MUCIN) (PEM) (PEMT) (EPISIALIN) (TU- 
MOR-ASSOCIATED MUCIN) (CARCINOMA-ASSOCIATED MUCIN) (TUMOR-ASSOCIATED EPITHELIAL MEM- 
BRANE ANTIGEN) (EMA) (H23AG) (PEANUT- REACTIVE URINARY MUCIN) (PUM) (BREAST CARCINOMA-ASSO- 

15 CIATED ANTIGEN DF3) V/3.50E-08//556aa//23%//P1 5941 

PROST20151370//Human probable zinc finger protein H101 mRNA, partial cds.//1 .60E- 11 //104aa//41%//U81 557 
PROST20152510//UDP-N-ACETYLGLUCOSAMINE-PEPTIDE N-ACETYLGLUCOSAMINYLTRANSFERASE 100 
KDA SUBUNIT (EC 2.4.1 .-) (0-GLCNAC TRANSFERASE P100 SUBUNIT).//6.00E-17//148aa//34%//015294 
PROST20152870//Homo sapiens APC2 gene, exon 14.//1 .60E-05//195aa//309b//AJ131187 

20 PROST20155370//ZINC FINGER PROTEIN 38 (ZFP-38) (CTFIN51) (TRANSCRIPTION FACTOR RU49).//1 .10E-72// 
140aa//55%//Q07231 
PROST201 56360 
PROST201 59320 

PROST20168600//Homo sapiens six transmembrane epithelial antigen of prostate (STEAP1) mRNA, complete cds.// 
25 1 ,20E-70//237aa//54%//AF1 86249 

PUAEN10000650//Homo sapiens TSC-22 related protein (TSC-22 R) mRNA, complete cds.//7.90E-44//95aa//98%// 

AF1 53603 

PUAEN10000870 

PUAEN 10001 640//Mus musculus cerebellar postnatal development protein-1 (Cpd1 ) mRNA, partial cds.//2.80E-126// 
30 270aa//90%//U89345 
PUAEN20000800 

PUAEN20001520//L-A VIRUS GAG PROTEIN N-ACETYLTRANSFERASE (EC 2.3.1 .-)Jh .70E-34//145aa//5196// 
Q03503 

PUAEN20002470//PR MOV-1 0.//6.30E-1 02//405aa//44%//P23249 
35 PUAEN200031 20//ENHANCER OF ZESTE HOMOLOG 2 (ENX-1).//0//643aa//97%//Q15910 SALGL1 0001070// 
CH-TOG PROTEIN (COLONIC AND HEPATIC TUMOR OVER-EXPRESSED PROTEIN) (KIAA0097).//5.30E-150// 
297aa//96%//Q14008 
SKMUS20006790 

S KM US20007260//M I CROTUBULE -ASSOCIATED PROTEIN 1B (MAPI. 2) (MAP1(X)) [CONTAINS: NIAP1 LIGHT 
40 CHAIN LC1 ].//8.1 0E-05//396aa//23%//P14873 

SKMUS20008730//smoothelin large isoform L2 [Homo sapiens]7/1 .20E-221//217aa//98%//AAF01481 
SKMUS20017400//TROPOMYOSIN ALPHA CHAIN, SKELETAL MUSCLE TYPE.//3.20E-97//242aa//81%//P06753 
SKMUS20020770 
SKMUS20026340 

45 SKMUS20040440//60S RIBOSOMAL PROTEIN L3 (HIV-1 TAR RNA BINDING PROTEIN B) (TARBP-B)7/3.60E-189// 
229aa//99%//P39023 
SKMUS20064810 

SKMUS20073150//20-HYDROXYECDYSONE PROTEIN PRECURSOR (20-HE).//1 .50E-05//129aa//32%//P29681 
SKMUS20073590//tropomodulin 4 (muscle) [Homo sapiens]//3.90E-58//115aa//100%//NPJ)37485 
50 SKMUS20079150//splicing factor 3a, subunit 3, 60kD; pre-mRNA splicing factor SF3a (60kD), similar to S. cerevisiae 
PRP9 (spliceosome-associated protein 61) [Homo sapiens]//9.10E-13//65aa//60%//NP_006793 
SKMUS20091900 

SKNMC1 0001 230//CYCLIN-DEPENDENT KINASE 4 INHIBITOR A (CDK4I) (P1 6-INK4) (P1 6-INK4A) (TUMOR SUP- 
PRESSOR CDKN2A).//1 .60E-08//105aa//38%//077617 
55 SKNMC20006350//65 KDA FK506-BINDING PROTEIN PRECURSOR (EC 5.2.1.8) (FKBP65) (FKBPRP) (PEPTI- 
DYL-PROLYL CIS-TRANS ISOMERASE) (PPIASE) (ROTAMASE) (IMMUNOPHILIN FKBP65).//1.10E-185//419aa// 
79%//Q61576 
SKNSH10001010 
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SKNSH20007160 

SKNSH20009710//TROPOMYOSIN, CYTOSKELETAL TYPE (TM30-NM)713.20E-74//174aa//86%//P 12324 
SKNSH20030640//SPLICEOSOME ASSOCIATED PROTEIN 62 (SAP 62) (SF3A66)7/3.00E-06//127aa//33%// 
Q62203 
5 SKNSH20040390 

SKNSH20052400//VEGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 7/3.60E-1 7//1 75aa//29%//Q00808 
SKNSH20057920//CALCIUM/CALMODULIN-DEPENDENT PROTEIN KINASE TYPE I (EC 2.7.1 .123) (CAM KINASE 
l).//5.00E-37//197aa//40%//Q14012 
SKNSH20068220 
10 SKNSH20094350 

SMINT20000070//Mus musculus mRNA for granuphilin-b, complete cds.//2.70E-44//128aa//41%//AB025259 
S M I NT20002320//S ERIN E/TH REO N I N E PROTEIN PHOSPHATASE 2A, 56 KDA REGULATORY SUBUNIT, BETA 
ISOFORM (PP2A, B SUBUNIT, B' BETA ISOFORM) (PP2A, B SUBUNIT, B56 BETA ISOFORM) (PP2A, B SUBUNIT, 
PR61 BETA ISOFORM) (PP2A, B SUBUNIT, R5 BETA ISOFORM)7/4.70E-160//299aa//100%//Q15173 
15 SMINT20006020//faciogenital dysplasia protein 2 [Mus musculus]//4.10E-158//327aa//87%//AF017368 

SMINT20006090//Oryctolagus cuniculus mRNA for parchorin, complete cds.//7.50E-87//175aa//93%//AB035520 
SMINT20007470//TRICHOHYALIN.//1 .20E-37//492aa//28%//Q07283 

SMINT200081 1 0//CALCIUM -TRANS PORTING ATPASE 2C1 (EC 3.6.1 .38) (ATP-DEPENDENT CA2+ PUMP PMR1).// 
1 .20E-50//1 65aa//63%//P981 94 
20 SMINT20011830 

SMINT20011950//ZINC FINGER PROTEIN 202 7/1 .90 E-67//426aa//40%//095 125 

SMINT20012220 

SMINT20013970 

SMINT20014610 

25 SMINT20016150//FERRITIN LIGHT CHAIN (FERRITIN L SLJBUNIT).//3.50E-91//174aa//100%//P02792 
SMINT20017310 
SMINT20021260 
SMINT20023110 

SMINT20024140//IG KAPPA CHAIN V-IV REGION B17 PRECURSOR7/1 .20E-60//134aa//87%//P06314 
30 SMINT20026200//ENL PROTEIN.//1 .10E-05//260aa//24%//Q03111 

SMINT20028800//tumorsupressor protein - fruit fly (Drosophila melanogaster)7/2.00E-78//493aa//34%//T13797 
SMINT20028840//CMRF35 ANTIGEN PRECURSOR7/8.40E-1 9//136aa//41%//Q 08708 
SMINT20030740//ZINC FINGER PROTEIN 136 7/4.50 E-194//535aa//63%//P52737 
SMINT20031280 

35 SMINT20035050//GTPASE ACTIVATING PROTEIN BEM2/IPL27/1 .20E-07//1 34aa//26%//P39960 

SMINT20035510//Drosophila melanogaster La related protein (larp) mRNA, partial cds7/5.40E-39//334aa//30%// 
AF221108 

SMINT20036440//Drosophila melanogaster epsin-like protein mRNA, complete cds7/2.90E-69//446aa//41%// 
AF233291 

40 SMINT20038660//Homo sapiens HNOEL-iso (HNOEL-iso) mRNA, complete cds7/5.60E-201//368aa//99%//AF201 945 
SMINT20039050//Homo sapiens TRIAD3 mRNA, partial cds.//3.20E-86//156aa//100%//AF228527 
SMINT20043390 

SMINT20044140//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2)7/3.50E-39//440aa//28%//P51523 
SMINT20044730//UBIQUINONE BIOSYNTHESIS PROTEIN AARF7/6.20E-22//272aa//27%//P27854 
45 SMINT20045470 
SMINT20045830 
SMINT20045890 
SMINT20047290 
SMINT20048720 

50 SMINT20049920//CYTOSOLIC PURINE 5'-NUCLEOTIDASE (EC 3.1 .3.5)7/4.1 0E-39//1 32aa//57%//P49902 

SMINT20052130//Rattus norvegicus mRNA for gankyrin homologue, complete cds7/9.20E-07//104aa//33%// 
AB022014 

SMINT20054050//ABC1 PROTEIN HOMOLOG PRECURSOR7/1 .40E-98//467aa//45%//Q92338 
SMINT20056230//lg mu chain precursor, membrane-bound (clone 201) -human//5.60E-233//422aa//78%//S1 4683 
55 SMINT20056240 

SMINT20062050//PLECTIN.//7.50E-17//436aa//25%//P30427 

SMINT20067080 

SMINT20070620 
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SMINT20074330//tektin A1 [Strongylocentrotus purpuratus]//3.10E-26//125aa//45%//M97188 
SMINT20077920 

SMINT20077960//GELSOLIN PRECURSOR, PLASMA (ACTIN-DEPOLYMERIZING FACTOR) (ADF) (BREVIN) 
(AGEL)7/3.30E-246//459aa//99%//P06396 
5 SMINT20081330 

SMINT20083290//IG ALPHA- 1 CHAIN C REGION.//4.40E-196//352aa//99%//P01876 

SMINT20084910 

SMINT20085310 

SMINT20085450 

10 SMINT20086250//GLYCINE CLEAVAGE SYSTEM H PROTEIN PRECURSOR.//2.20E-40//70aa//97%//P23434 
SMINT20086720//ZINC FINGER PROTEIN 191 7/1 .40E-29//109aa//55%//01 4754 

SMINT20088440//IG KAPPA CHAIN V-ll REGION RPMI 6410 PRECURSOR7/5.10E-44//117aa//78%//P06310 

SMINT20088690 

SMINT20089210 

15 SMINT20089600//Homo sapiens mRNA for PICK1 , complete cds./13.10E-145//278aa//100%//AB026491 

SMINT20091190//IG ALPHA- 1 CHAIN C REGION.//3.70E-198//353aa//99%//P01 876 

SMINT20092120 

SMINT20092160 

SMINT20093630 
20 SMINT20094150 

SMINT20094680//Homo sapiens mawbp mRNA for MAWD binding protein, complete cds7/2.80E-50//77aa//1 00%// 
AB049758 
SPLEN20005160 
SPLEN20005370 
25 SPLEN20006950 
SPLEN20011350 
SPLEN20012450 
SPLEN20015030 

SPLEN20015100//HYPOTHETICAL 72.5 KDA PROTEIN C2F7.10 IN CHROMOSOME l7/7.50E-16//121aa//36%// 
30 Q09701 

SPLEN20016500 

SPLEN20017610 

SPLEN20017810 

SPLEN20019120 
35 SPLEN20020530 

SPLEN20023430 

SPLEN20023540//H .sapiens mRNA for F25B3.3 kinase like protein from C. elegans.//1 .50E-205//385aa//99%// 
Y12336 

SPLEN20023850//DNA REPAIR PROTEIN RAD187/3.00E-56//469aa//30%//P53692 
40 SPLEN200241 90//EGF-containing fibulin-like extracellular matrix protein 1 ; fibrillin-like [Homo sapiens]//3.70E-192// 
327aa//99%//NP_061 489 
SPLEN20024510 

SPLEN20024620//Homo sapiens mRNA for acetyl LDL receptor, complete cds.//1 .00E-217//401aa//100%//D86864 
SPLEN20024770//Rattus norvegicus (rsec6) mRNA, complete cds.//2.10E-88//545aa//31%//U32575 
45 SPLEN20024930//Rattus norvegicus PIPP mR.NA for proline-rich inositol polyphosphate 5-phosphatase, complete 
cds7/0//639aa//91 %//AB032551 
SPLEN20029170 
SPLEN20036780 

SPLEN200391 80//TENSIN.//2.70E-1 35//341 aa//653o//Q04205 
50 SPLEN20040780//C B (SMALL PROLINE-RICH PROTEIN 1B) (SPR1B) (SPR1 
B)7/5. 1 0E-1 2//1 1 0aa//37%//Q62267 

SPLEN20041810//BC-2 protein [Homo sapiens]//4.00E-24//59aa//96%//AF042384 
SPLEN20042200//TRANSCRIPTIONAL REPRESSOR CTCF.//8.40E-22//1 27aa//33%//P4971 1 
SPLEN20043430 
55 SPLEN20043460 

SPLEN20043680//DNA-REPAIR PROTEIN COMPLEMENTING XP-D CELLS (XERODERMA PIGMENTOSUM 
GROUP D COMPLEMENTING PROTEIN) (DNA EXCISION REPAIR PROTEIN ERCC-2).//7.30E-171//325aa//99%// 
P 18074 
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SPLEN20045550 

SPLEN20048800//Homo sapiens mRNA for N-Acetylglucosamine kinase.//1 .90E-51//1 04aa//100%//AJ242910 
SPLEN20049840//M.musculus mRNA for myosin l.//0//1093aa//89%//X97650 
SPLEN20050090//TRICHOHYALIN 7/7.20E-1 7//554aa//23%//P37709 

5 SPLEN20051420 

SPLEN20054160//Dof protein [Drosophtla melanogaster]//9.60E-14//222aa//29%//AJ0 10641 
SPLEN20054500//Homo sapiens mRNA for putative dipeptidase.//7.10E-130//244aa//100%//AJ295149 
SPLEN20055600//ZINC FINGER PROTEIN 46 (ZINC FINGER PROTEIN KUP).//3.00E-56//155aa//63%//P24278 
SPLEN20057830//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1).//1 .10E-16//139aa//43%//P49646 

10 SPLEN20057900//Homo sapiens N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase mRNA, 
complete cds7/9.00E-75//1 30aa//93%//AF1 87072 
SPLEN20058180 

SPLEN20059270//ZINC-FINGER PROTEIN RFP (RET FINGER PROTEIN).//7.20E-61//497aa//31%//P14373 
SPLEN20062830 

15 SPLEN20063250//zinc finger protein nocA - fruit fly (Drosophila melanogaster)//9.00E-1 6//364aa//30%//A55929 

SPLEN20063890//FIBROMODULIN PRECURSOR (FM) (COLLAGEN-BINDING 59 KDA PROTEIN).//2.60E-118// 

242aa//94%//Q06828 

SPLEN20067010 

SPLEN20071820//Homo sapiens DNA polymerase mu (Pol mu) mRNA, complete cds.//4.70E-62//116aa//100%// 
20 AF1 76097 

SPLEN20073500//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 2 (B94 PR0TEIN).//3.50E-10//463aa// 

25%//Q03169 

SPLEN20073880 

SPLEN20076190 

25 SPLEN20076470//KINESIN LIGHT CHAIN (KLC).//2.40E-1 8//1 37aa//38%//P46825 

SPLEN20080070//TISSUE ALPHA-L-FUCOSIDASE PRECURSOR (EC 3.2.1.51) (ALPHA-L-FUCOSIDA I) (AL- 

PHA-L-FUCOSIDEFUCOHYDROLASE).//4.70E-253//359aa//98%//P04066 

SPLEN20081640 

SPLEN20085910//Homo sapiens protein activator of the interferon-induced protein kinase (PACT) mRNA, complete 
30 cds.//2.90E-23//52aa//1 00%//AF072860 
SPLEN20087370 
SPLEN20087860 

SPLEN20090880//HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, A-24(A-9) ALPHA CHAIN PRECURSOR (AW- 
24)7/1 .40E-66//1 53aa//83%//P05534 
35 SPLEN20098030//TRANSCRIPTION INTERMEDIARY FACTOR 1-BETA (NUCLEAR COREPRESSOR KAP-1) 
(KRAB-ASSOCIATED PROTEIN 1)7/4.00E-18//267aa//26%//Q 13263 

SPLEN20100040//258.1 1 KDA PROTEIN C210RF5 (KIAA0933).//7.90E-46//223aa//43%//Q9Y3R5 
SPLEN20101950//SODIUM/HYDROGEN EXCHANGER 6 (NA(+)/H(+) EXCHANGER 6) (NHE-6) (KIAA0267).//7.70E- 
1 1 2//353aa//61 %//Q92581 
40 SPLEN20104150 
SPLEN201 04690 
SPLEN20105100 

SPLEN20108000//Homo sapiens steroid dehydrogenase homolog mRNA, complete cds.//9.30E-73//155aa//94%// 
AF078850 
45 SPLEN201 08460 

SPLEN20110180//Homo sapiens golgi membrane protein GP73 mRNA, complete cds.//1.60E-162//347aa//91%// 
AF236056 
SPLEN201 10210 
SPLEN201 10860 
50 SPLEN20111450 
SPLEN20114190 

SPLEN2011 6720//Homo sapiens misato mRNA, partial cds7/1 .00E-277//481aa//89%//AF272833 
SPLEN20117580 

SPLEN20118050//BONE/CARTILAGE PROTEOGLYCAN I PRECURSOR (BIGLYCAN) (PG-S1 ).//1 .00E-123//240aa// 
55 9796//P21810 

SPLEN20121790 
SPLEN201 25230 
SPLEN20126110 
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SPLEN20135030//HomosapiensPDZ-LIM protein mystique m RN A, complete cds.//3.30E-92//178aa//97%//AY007729 
SPLEN20136700 

SPLEN20136730//Homo sapiens RAB-like protein 2A (RABL2A) mRNA, complete cds.//9.30E-41//102aa//90%// 
AF095350 
5 SPLEN20137530 

SPLEN20138600//NUMB protein [Homo sapiens].//1.00E-97//102aa//100V/AAD01548 

SPLEN20139100//Human (hybridoma H210) anti-hepatitis A IgG variable region, constant region, complementarity- 
determining regions mRNA, complete cds.//2. 30 E-227//490aa//85%//M 87789 

SPLEN20139360//MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN B) (CENP-B).//5.10E-35// 
10 295aa//32%//P07199 
SPLEN201 75920 

SPLEN20176130//Homo sapiens mRNA for ALEX1 , complete cds.//9.90E-21//161aa/132%//AB039670 
SPLEN201 77400 

SPLEN201 80980//METHIONYL-TRNA SYNTHETASE (EC 6.1.1.10) (METHIONINE-TRNA LIGASE) (METRS).// 
15 3.40E-57//1 33aa//87%//P561 92 

SPLEN20181570//TRICHOHYALINy/7.80E-45//832aa//23%//P37709 
SPLEN201 82850 

SPLEN20182990//Mus musculus actin-binding protein (ENC-1) mRNA, complete cds.//1 .90E-49//586aa//26%// 
U65079 

20 SPLEN201 83020//BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE PROTEIN 
PRECURSOR (HSPG) (PERLECAN) (PLC).//1 .20E-24//128aa//46%//Q05793 
SPLEN201 83950 

SPLEN20187490//RESTIN (CYTOPLASMIC LINKER PROTEIN-1 70 ALPHA-2) (CLIP-170) (REED- STERNBERG IN- 
TERMEDIATE FILAMENT ASSOCIATED PROTEIN)y/5.10E-09//411aa//22%//P30622 
25 SPLEN201 90080 

SPLEN201 90430//NEURALI2ED PROTEIN.//2.80E-09//181aa//27%//P29503 
SPLEN201 90770 

SPLEN20191020//Homo sapiens MIST mRNA, partial cds.//1 .50E-207//376aa//99%//AB032369 
SPLEN201 92570 
30 SPLEN201 93230 
SPLEN201 93490 
SPLEN201 93750 

SPLEN20193790//INTERFE RON-REGULATED RESISTANCE GTP-BINDING PROTEIN MXA 
(INTERFERON- INDUCED PROTEIN P78) (IFI-78K).//0//572aa//98%//P20591 
35 SPLEN20195710//KINESIN LIGHT CHAIN (KLC).//5.80E-28//145aa//45%//P46824 
SPLEN201 97090 
SPLEN201 97740 

SPLEN20197930//Rattus norvegicus putative transcription factor LUZP (Luzp) mRNA, complete cdsV/6.10E-124// 
275aa//90%//AF181259 
40 SPLEN20198390//TIPD PROTEIN.//2.80E-52//307aa//37%//015736 
SPLEN201 99850 
SPLEN20200070 
SPLEN20200340 

S PLEN2020 1 830//BO N E/CARTI LAG E PROTEOGLYCAN I PRECURSOR (BIGLYCAN) (PG-S1 ).//2.50E-1 52//283aa// 
45 1 00%//P21810 

SPLEN20203590 

SPLEN20204670 

SPLEN20205120 

TESOP1 0000350 
50 TESOP1 0001 600 

TESTI10000190//MUCIN 1 PRECURSOR (POLYMORPHIC EPITHELIAL MUCIN) (PEM) (PEMT) (EPISIALIN) (TU- 
MOR-ASSOCIATED MUCIN) (CARCINOMA-ASSOCIATED MUCIN) (TUMOR-ASSOCIATED EPITHELIAL MEM- 
BRANE ANTIGEN) (EMA) (H23AG) (PEANUT- REACTIVE URINARY MUCIN) (PUM) (BREAST CARCINOMA-ASSO- 
CIATED ANTIGEN DF3).//1.40E-23//667aa//28%//P15941 
55 TEST1 10000850 

TESTI10001570//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//1 .30E-148//598aa//4696//P51523 

TESTI2000431 0//TRICHOHYALIN.//1 .40E-07//247aa//25%//P22793 

TESTI20005980 
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TESTI20006160//CALCIUM-BINDING P ROTE IN.//1 .1 0E-11 //260aa//28%//P35085 
TESTI20006830//GAR2 PROTEIN.//2.90E-05//221 aa//1 9%//P41 891 

TESTI20012080/mJBULIN-TYROSINE LIGASE (EC 6.3.2.25) (TTL).//2.20E-22//217aa//35%//P38584 
TESTI20012360 
5 TESTI20016970 
TESTI20019590 
TESTI20028020 

TESTI20029100//FIBROSIN (FRAGMENT).//1 .60E-10//70aa//48%//Q60791 

TESTI20030200//DOUBLE-STRANDED RNA-SPECIFIC EDITASE 1 (EC 3.5.-.-) (DSRNA ADENOSINE DEAMI- 
10 NASE) (RNA EDITING ENZYME 1)y/1.90E-30//192aa//38%//P51400 
TESTI20030440//TRICHOHYALIN.//2.80E-21//412aa//26%//P37709 
TESTI20030610 

TESTI20031310//ALPHA-1-ANTICHYMOTRYPSIN PRECURSOR (ACT).//7.80E-222//423aa//99%//P01011 
TESTI20031 41 0//SYNAPTONEMAL COMPLEX PROTEIN 1 (SCP-1 PROTEIN).//6.10E-11//449aa//24%//Q03410 
15 TESTI20032770//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG PRECURSOR.//1.40E-07//121aa//37%// 
P40602 

TESTI20034750 
TESTI20035330 

TESTI20035790//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 
20 2.30E-44//346aa//37%//Q081 70 

TESTI20038240//Mus musculus neprilysin-like metal lope ptidase 1 (N11) mRNA, complete cds.//0//770aa//77%// 

AF1 76569 

TESTI20040850 

TESTI20041 630//Mus musculus mRNA for type II cytokeratin, complete cds.//1 .30E-1 51//407aa//75%//AB033744 
25 TESTI20043130 

TESTI200431 80//mouse mRNA for megakaryocyte potentiating factor, complete cds.//8.40E-06//392aa//24%//D86370 

TESTI20043220//ORM1 PROTEIN.//6.40E-21//1 38aa//37%//P53224 

TESTI20043910 

TESTI20043990//APOMUCIN (MUCIN CORE PROTEIN) (FRAGMENT).//8.70E-64//283aa//53%//P12021 
30 TESTI20044900//Strongylocentrotus purpuratus radial spokehead mRNA, complete cds.//2.50E-150//447aa//61%// 
U73123 

TESTI20045390//Homo sapiens adlican mRNA, complete cds.//1 .80E-173//632aa//47%//AF245505 

TESTI20045740 

TESTI20046110 

35 TESTI20046490//Homo sapiens B2 gene partial cDNA, clone B2E.//4.30E-33//284aa//34%//AJ002220 
TESTI20046540 

TESTI20046870//RETINAL-BINDING PROTEIN (RALBP).//4.00E-06//244aa//23%//P491 93 
TESTI20046890//Mus musculus axotrophin mRNA, complete cdsy/2.30E-40//173aa//53%//AF1 55739 
TESTI20047370//ATROPHIN-1 (DENTATORUBRAL-PALLIDOLUYSIAN ATROPHY PROTEIN).//1 .90E-22//695aa// 
40 28%//P54258 

TESTI20047930//Homo sapiens NY-REN-2 antigen mRNA, complete cds.//1 .50E-1 91 //530aa//67%//AF1 55095 
TESTI20049060//H .sapiens mRNA for SIRP-beta1 V/2.50E-31//1 72 aa//46%//Y 10376 
TESTI20049410 
TESTI20049990 
45 TESTI20050170 
TESTI20050400 

TESTI20050720//SUCCINYL-COA:3-KETOACID-COENZYME A TRANSFERASE PRECURSOR (EC 2. 8. 3. 5) (SUC- 
CINYL COA:3-OXOACID COA-TRANSFERASE) (OXCT).//1 .40E-208//519aa//74%//P55809 

TESTI20051200//POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (EC 2.4.1.41) (PROTEIN- UDP 
50 ACETYLGALACTOSAMINYLTRANSFERASE) (UDP-GALNAC:POLYPEPTIDE, N-ACETYLGALACTOSAMINYL- 
TRANSFERASE) (GALNAC-T1 ).//6.1 0E-50//1 89aa//49%//Q1 0472 
TESTI20051 730//MALTOSE PERMEASE.//2.20E-05//327aa//23%//Q45632 
TESTI20052670 

TESTI20053070//H YPOTH ETICAL WD-REPEAT PROTEIN SLL01637/5.00E-06//172aa//28%//Q55563 
55 TESTI20053260 
TESTI20053780 
TESTI20053800 
TESTI20053950 
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TESTI20054700//Streptococcus pneumoniae strain g375 surface protein PspC (pspC) gene, pspC-8. 1 allele, complete 
cds.//6.70E-1 1//21 9aa//32%//AF1 5401 5 

TESTI20055680//TRANSLATION INITIATION FACTOR IF-2.//5.90E-08//98aa//41%//060841 
TESTI20055880 

5 TESTI20056030//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds7/3.60E-103//316aa//6896// 
AF204231 
TESTI20057200 

TESTI20057430//ZINC FINGER PROTEIN 8 (ZINC FINGER PROTEIN HF.18) (FRAGMENT).//3.00E-307//543aa// 
99%//P 17098 

10 TESTI20057590//TYROSINE-PROTEIN KINASE-LIKE 7 PRECURSOR (COLON CARCINOMA KINASE-4) (CCK-4).// 
7.80E-07//1 52aa//27%//Q1 3308 

TESTI20057840//INHIBITOR OF APOPTOSIS PROTEIN (IAP) (INHIBITOR OF T CELL APOPTOSIS PROTEIN).// 
7.20E-08//86aa//36%//Q90660 

TESTI20057880//CALDESMON (CDM)7/6.20E-08//203aa//29%//P1 2957 
15 TESTI 20058350//S ERIN E/TH R EON I N E PROTEIN PHOSPHATASE 2A, 55 KDA REGULATORY SUBUNIT B, GAM- 
MA ISOFORM (PP2A, SUBUNIT B, B-GAMMA ISOFORM) (PP2A, SUBUNIT B, B55-GAMMA ISOFORM) (PP2A, 
SUBUNIT B, PR55-GAMMA ISOFORM) (PP2A, SUBUNIT B, R2-GAMMA ISOFORM)7/2.00E-232//426aa//99%// 
P50410 

TESTI20058920//TUBULIN ALPHA-3/ALPHA-7 CHAIN7/3.50E-148//277aa//98%//P05214 
20 TESTI20059080//Homo sapiens hyaluronidase (LUCA-3) mRNA, complete cds.//2.40E-1 70//298aa//1 00%//AF04071 0 
TESTI20059330 
TESTI20059370 
TESTI20059480 
TESTI20059790 

25 TESTI20059810//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//7.50E-153//536aa//52%//P51523 
TESTI20060080//MYOSIN HEAVY CHAIN, CLONE 203 (FRAGMENT) 7/5.30 E-09//236aa//24%//P39922 
TESTI20060150 
TESTI20060350 
TESTI20060450 

30 TESTI20060830//Mus musculus mRNA for MIWI (piwi), complete cds7/0//824aa//94%//AB032604 
TESTI20061090 

TESTI20061200//NITRITE EXTRUSION PROTEIN (NITRITE FACILITATOR)7/4.90E-05//379aa//23%//P46907 

TESTI20062120//poly(A)-specific ribonuclease (deadenylation nuclease) ; deadenylation nuclease [Homo sapiens]// 

1 .00E-38//1 44aa//36%//NP_002573 
35 TESTI20062180 

TEST120062580 

TESTI20063330 

TESTI20063410 

TESTI20063600 
40 TESTI20064370 

TESTI20064530//microtubule-associated protein like echinoderm EMAP [Homo sapiens]7/1 .00E-173//562aa//48%// 
XP_009139 

TESTI20064650//Rattus norvegicus myr 6 myosin heavy, chain mRNA, complete cds.//0//645aa//91%//U60416 
TESTI20064990 

45 TESTI20065650//INNER CENTROMERE PROTEIN (INCENP)7/1 .00E-14//273aa//27%//P53352 
TESTI20066150 
TESTI20066170 
TESTI20066280 

TESTI20066330//FIBRONECTINPRECURSOR7/9.10E-12//408aa//22%//Q91740 
50 TESTI20066590 

TESTI20066650//CELL DIVISION CONTROL PROTEIN 25.//2.20E-18//216aa//32%//P04821 
TESTI20067350 

TESTI20067440//INTRACELLULAR PROTEIN TRANSPORT PROTEIN US017/2.90E-11//553aa//20%//P25386 
TESTI20067480//ZINC FINGER PROTEIN 184 (FRAGMENT) 7/8.60 E-134//421aa//50%//Q99676 
55 TESTI20068530 

TESTI20068790//TR ANS-ACTI NG TRANSCRIPTIONAL PROTEIN ICPO (P135 PROTEIN) (IER 2. 9/ER2.6)7/7.50E- 

0e//240aa//28%//P291 28 

TESTI20068940 
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TESTI20070400//Homo sapiens CTL2 gene.//5.90E-229//694aa//55%//AJ245621 
TESTI20070740 

TESTI20071130//MYOSIN HEAVY CHAIN, CARDIAC MUSCLE ALPHA ISO FO RM 7/6.30 E-09//399aa//23%//Q02566 
TESTI20071630//GLIOMA PATHOGENESIS-RELATED PROTEIN (RTVP-1 PROTEIN).//5.10E-44//203aa//42%// 
5 P48060 

TESTI20073460 

TESTI200752401 /HYPOTHETICAL ZINC FINGER PROTEIN KIAA0961 .//3.20E-145//492aa//56%//Q9Y2G7 
TESTI20076570//Homo sapiens mitogen-activated protein kinase phosphatase x (MKPX) mRNA, complete cds.// 
7.20E-66//1 26aa//1 00%//AF1 6551 9 
10 TESTI20076920 
TESTI20079060 

TESTI20079220//ZINC FINGER PROTEIN 29 (ZFP-29).//2.00E-73//281aa//49%//Q07230 

TESTI20079980//SEGMENT POLARITY PROTEIN DISHEVELLED HOMOLOG DVL-1 (DISHEVELLED-1) (DSH HO- 
MOLOG 1 )7/7. 1 0E-21 2//41 3aa//93%//P51 141 
15 TESTI20080460 

TESTI20081890//SPA-1 like protein p1294 [Rattus norvegicus)//5.80E-129//385aa//41%//AF026504 
TESTI20083890 

TESTI20084250//OXYSTEROL-BINDING PROTEIN7/3.70E-183//561 aa//62%//P1 6258 
TESTI20085670 

20 TESTI20086840//CARTIUVGE MATRIX PROTEIN PRECURSOR (MATRILIN-1).//3.40E-09//181aa//29%//P05099 

TESTI20088840//ZINC-FINGER PROTEIN RFP (RET FINGER PROTEIN)7/2.60E-73//258aa//31%//Q62158 

TESTI20089290 

TESTI20090180 

TESTI20090970 
25 TESTI20091360 

TESTI20092170 

TESTI20093900 

TESTI20094620 

TESTI20095200//HYPOTHETICAL 98.3 KDA PROTEIN B0495.7 IN CHROMOSOME 117/1 .30 E-63//328aa//40%// 
30 Q09216 

TESTI20095440//probable membrane protein YOR240w - yeast (Saccharomyces cerevisiae)//1 .00E-25//98aa//55%// 
S67133 

TESTI20095770//NEDD1 PROTEIN (FRAGMENT).//5.80E-297//648aa//85%//P3321 5 
TESTI20095880//HYPOTHETICAL SYMPORTER SLL1 3747/3.80E-26//243aa//27%//P741 68 
35 TESTI20097270 

TESTI20099350//MYOSIN HEAVY CHAIN, NONMUSCLE TYPE B (CELLULAR MYOSIN HEAVY CHAIN, TYPE B) 
( N M M HC-B) 7/1 . 50 E-26//566aa//23%//P35580 

TESTI20100090//Homo sapiens endocytic receptor Endo180 (ENDO180) mRNA, complete cds.//5.00E-161//317aa// 
93%//AF1 34838 
40 TESTI201 02390 

. TESTI20103690//Columba livia mRNA for 5 , -nucleotidase.//2.70E-114//324aa//66%//AJ131243 

TESTI201 04090//TRANSCRI PTIONAL ENHANCER FACTOR TEF-4 (EMBRYONIC TEA DOMAIN-CONTAINING 
FACTOR) (ETF) (ETEF-1) (TEAD-2).//1 .60E-228//450aa//92%//P48301 
TESTI201 051 30//MYOTUBULARIN 7/4.30 E-95//537aa//38%//Q1 3496 
45 TESTI20105910//AMILORIDE-SENSITIVE SODIUM CHANNEL DELTA-SUBUNIT (EPITHELIAL NA+ CHANNEL 
DELTA SUBUNIT) (DELTA ENAC) (NONVOLTAGE-GATED SODIUM CHANNEL 1 DELTA SUBUNIT) (SCNED) (DEL- 
TA NACH)7/0//636aa//97%//P51172 

TESTI20106170//Mus musculus spermatid WD-repeat protein mRNA, complete cds.//2.20E-167//367aa//83%// 
AF274321 

50 TESTI20106820//PROTEIN KINASE C, ETA TYPE (EC 2.7.1.-) (NPKC-ETA) (PKC-L)7/5.40E-53//97aa//1 00%// 
P24723 

TESTI20107240//EUKARYOTIC TRANSLATION INITIATION FACTOR 3 SUBUNIT 10 (El F-3TH ETA) (EUKARYOTIC 
TRANSLATION INITIATION FACTOR 3 LARGE SUBUNIT) (PNLA-35)7/5.60E-07//428aa//22%//Q40554 
TESTI20107320//G1/S-SPECIFIC CYCLIN C-TYPE.//1 .20E-05//130aa//26%//P93411 
55 TESTI201 07340 

TESTI201 08060//SERINE/THREONINE PROTEIN PHOSPHATASE PP1 -BETA CATALYTIC SUBUNIT (EC 3.1 .3.1 6) 
(PP-1 B)7/1 . 1 0E-78//1 45aa//1 00%//P371 40 

TESTI20112540//CALDESMON(CDM)7/9.10E-06//203aa//30%//P12957 
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TESTI20112860//MYOSIN LIGHT CHAIN KINASE (EC 2.7.1.117) (MLCK).//1.60E-54//290aa//40%//P25323 

TESTI20113150 

TESTI201 13940 

TESTI20114480//Human 1(3)mbt protein homolog mRNA, complete cds7/7.10E-146//582aa//49%//U89358 
5 TESTI201 16050 

TESTI20116120//Aegilops squarrosa partial GAG56D gene for gamma-gliadin, accession Clae 247/3. 10 E-07//93aa// 
40%//AJ389681 
TESTI201 17500 
TESTI201 18460 
10 TESTI201 20500 
TESTI201 20900 

TESTI201 21 040//PROBABLE PROTEIN PHOSPHATASE 2CT23F1 1 .1 (EC3.1 .3.16) (PP2C).//8.60E-13//91aa//39%// 
P49596 

TESTI20121 71 0//HYPOTHETICAL 57.5 KDA PROTEIN IN VMA7-RPS25A INTERGENIC REGION.//5.00E-08// 
15 292aa//26%//P53214 

TESTI20122070//Xenopus laevis ER1 mRNA, complete cds.//1 .80E-78//341aa//49%//AF015454 

TESTI20122440 

TESTI201 24440 

TESTI20125280//M-PROTEIN, STRIATED MUSCLEV/3.60E-68//295aa//46%//Q021 73 
20 TESTI20125440 

TESTI20125920//G PROTEIN PATHWAY SUPPRESSOR 1 (GPS1 PROTEIN) (MFH PROTEIN).//5.20E-199//367aa// 
98%//Q 13098 

TESTI20126280//Mus musculus STAP mRNA for sperm tail associated protein, complete cds.//4.60E-213//769aa// 
57%//AB029919 

25 TESTI20130530//INSULIN-DEGRADING ENZYME (EC 3.4.24.56) (INSULYSIN) (INSULINASE) (INSULIN PRO- 
TEASE).//! ,60E-237//464aa//94%//P1 4735 

TESTI201 31 440//CARBOXY PEPTIDASE A1 PRECURSOR (EC 3.4.1 7.1 ).//2.80E-1 07//332aa/%58%//P1 5085 
TESTI201 32310 
TESTI201 32680 
30 TESTI201 34010 
TESTI201 34270 

TESTI20134680//CENTROMERIC PROTEIN E (CENP-E PROTEIN).//2.40E-08//796aa//19%//Q02224 
TESTI20134970//M. musculus Tenr mRNA for RNA binding protein.//5.50E-265//559aa//88%//X84693 
TESTI20136010/IMYOSIN HEAVY CHAIN, NONMUSCLE (CELLULAR MYOSIN HEAVY CHAIN) (NMMHC).//1 .10E- 
35 1 1 //438aa//23%//P 14105 

TESTI20140970/A/_segment translation product [Homo sapiens] .//6.30E-51//1 01 aa//99%//AAC80210 
TESTI201 42480 

TESTI20142540/IMPV17 PROTEIN.//8.00E-62//116aa//98%//P39210 

TESTI20143180//CENTROMERIC PROTEIN E (CENP-E PROTEIN).//3.30E-09//507aa//22%//Q02224 
40 TESTI20144390//TESTIS-SPECIFIC PROTEIN PBS13.//6.40E-76//251aa//63%//Q01755 

TESTI20145780//Mus musculus mRNA for SH2-containing leukocyte protein 657/1 .20E-13//91aa//36%//Y17159 
TESTI20148380//TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 3521 .//2.50E-05//193aa//2296//P31 948 
TESTI201 49880 

TESTI20150420//RHO-GTPASE-ACTIVATING PROTEIN 1 (GTPASE-ACTIVATING PROTEIN RHOGAP) (RHO-RE- 
45 LATED SMALL GTPASE PROTEIN ACTIVATOR) (CDC42 GTPASE-ACTIVATING PROTEIN) (P50-RHOGAP).// 
4.50E-09//1 29aa//34%//Q07960 

TESTI20150920//PROTEIN PHOSPHATASES PP1 REGULATORY SUBUNIT SDS22.//2.60E-16//193aa//34%// 
P22194 

TESTI20151050 
50 TESTI20151800 
TESTI201 52490 

TESTI201 5331 0//LAM IN B3.//6.30E-1 3//1 04aa//41 %//P48680 

TESTI201 54370//HYALU RON AN MEDIATED MOTILITY RECEPTOR (INTRACELLULAR HYALURONIC ACID BIND- 
ING PROTEIN) (RECEPTOR FOR HYALURONAN-MEDIATED MOTILITY).//1.10E-07//300aa//21%//075330 
55 TESTI201 59380 
TESTI20161010 

TESTI20162780//COTE1 PROTEIN.//1 .00E-107//207aa//99%//P81408 

TESTI201 62980//DNA-DAMAGE INDUCIBLE PROTEIN DDI1 .//1 .80E-39//1 74aa//48%//P40087 



211 



EP 1 308 459 A2 



TESTI20164210//PROTEIN KINASE C-BINDING PROTEIN NELL1 (NEL-LIKE PROTEIN 1) (FRAGMENT).//1.60E- 
88//1 63aa//92%//Q92832 
TESTI201 65680 
TESTI201 65990 

5 TESTI20166290//Homo sapiens NY-REN-50 antigen mRNA, partial cds.//3.00E-223//426aa//93%//AF155112 
TESTI20166670//HOMEOBOX PROTEIN HOX-B1 (HOX-2l).//6.70E-06//224aa//29%//P14653 
TESTI201 67580 

TESTI20168880//BREAKPOINT CLUSTER REGION PROTEIN (EC 2.7.1 .-).//2.10E-23//57aa//92%//P11274 
TESTI20169500//HYPOTHETICAL51.9 KDA PROTEIN C27F1.04C IN CHROMOSOME l.//9.50E-17//428aa//25%// 
10 Q10173 

TESTI20170170//Homo sapiens mRNA for chondroitin-4-sulfotransferase (C4ST gene).//1.10E-53//277aa//40%// 
AJ269537 
TESTI201 70280 
TESTI201 70690 
15 TESTI201 70890 

TESTI20171070//CYCLIC-AMP-DEPENDENT TRANSCRIPTION FACTOR ATF-4 (DNA-BINDING PROTEIN 
TAXREB67) (CYCLIC AMP RESPONSE ELEMENT-BINDING PROTEIN 2) (CREB2).//2.60E-63//136aa//91%// 
P 18848 

TEST1201 73050 
20 TESTI20173110 

TESTI20173960//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//1 .90E-104//335aa//48%// 
Q05481 

TESTI20175370//MYOSIN HEAVY CHAIN, NONMUSCLE (CELLULAR MYOSIN HEAVY CHAIN) (NMMHC).//1 .00E- 
11//290aa//25%//P14105 

25 TESTI201 76450//thioredoxin interacting factor [Mus musculus].//1 .00E-75//300aa//46%//AAG32665 
TESTI201 79230 
TESTI201 79510 

TESTI20180600//Homo sapiens HOM-TES-85 tumor antigen mRNA, complete cds.//8.10E-106//202aa//99%// 
AF1 24430 
30 TESTI201 82210 

TESTI20182760//AMILORIDE-SENSITIVE SODIUM CHANNEL DELTA-SUBUNIT (EPITHELIAL NA+ CHANNEL 
DELTA SUBUNIT) (DELTA ENAC) (NONVOLTAGE-GATED SODIUM CHANNEL 1 DELTA SUBUNIT) (SCNED) (DEL- 
TA NACH).//5.90E-1 85//336aa//99%//P51 1 72 

TESTI20183680//EARLY NODULIN 20 PRECURSOR (N-20).//5.10E-08//127aa//37%//P93329 
35 TESTI201 84280 

TESTI201847501/LAMININ ALPHA- 1 CHAIN PRECURSOR (LAMININ A CHAIN) 7/2. 30 E-200//377aa//99%//P25391 
TESTI20184760//ZINC FINGER PROTEIN 29 (ZFP-29).//9.70E-74//281aa//49%//Q07230 
TESTI201 84820 
TESTI20186110 
40 TESTI201 92570 

TESTI201 93080//GAR22 PROTEIN.//5.1 0E-1 53//31 3aa//92%//Q99501 
TESTI201 93520 

TESTI20194880//PERIOD CIRCADIAN PROTEIN 1 (CIRCADIAN PACEMAKER PROTEIN RIGUI) (HPER).//1 .50E- 
1 1//288aa//30%//015534 

45 TESTI20196690//Bos taurus pyruvate dehydrogenase phosphatase regulatory subunit precursor, mRNA, complete 
cds.//9. 1 0E-1 1 4//224aa//92%//AF026954 

TESTI20196970//THIMET OLIGOPEPTIDASE (EC 3.4.24.15) (ENDOPEPTIDASE 24.15) (MP78).//4.00E-106// 
147aa//100%//P52888 
TESTI201 97030 
50 TESTI201 97290 

TESTI201 97600//TRANSCRIPTIONAL REPRESSOR CTCF.//5.30E-1 20//271 aa//82%//P4971 1 
TESTI201 98540 
TESTI201 98600 

TESTI20199110//disintegrin-like testicular metalloproteinase (EC 3.4.24.-) IVb - crab-eating macaque (fragment) // 
55 1.00E-167//331aa//84%//!65253 
TESTI201 99980 
TESTI20200120 
TESTI20200840 
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TESTI20201760//SYNAPTONEMAL COMPLEX PROTEIN 1 (SCP-1 PROTEIN).//4.70E-09//1 B9aa//25%//Q 15431 

TESTI20202830 

TESTI20204260 

TESTI202051 00//TRICHOHYALIN.//2.60E-1 5//343aa//24%//P37709 
5 TESTI20205150 

TESTI20205250//phosphatidylinositol-4-phosphate 5-kinase homolog T3K9.2 - Arabidopsis thaliana //4.20E-21// 
194aa//33%//T02098 

TESTI20207170//Human testis-specific protein (TSPY) mRNA, complete cdsV/6.60E-111//231aa//96%//U58096 
TESTI20209050//HYPOTHETICAL 113.1 KDA PROTEIN IN PRES-FET4 INTERGENIC REGION.//1 .90E-05//462aa// 
10 22%//Q04893 
TESTI20210030 

TESTI20210570//RETINAL-B1NDING PROTEIN (RALBP).//5.00E-53//327aa//36%//P49193 
TESTI20211380 

TESTI20212970//PUTATIVE ATP-DEPENDENT RNA HELICASE YIR002C.//5.90E-85//458aa//32%//P40562 
15 TEST1 202 14630 

TESTI20215310//Homo sapiens calcyclin binding protein mRNA, complete cds.//6.60E-95//182aa//100%//AF057356 
TESTI20219110//TYROSINE-PROTEIN KINASE-LIKE 7 PRECURSOR (COLON CARCINOMA KINASE-4) (CCK-4).// 
4.00E-23//53aa//1 00%//Q1 3308 
TESTI20219390 

20 TESTt20220230//Bos taurus Reissner's fiber glycoprotein I mRNA, partial cds.//1 .20E-1 0//77aa//50%//AF078930 
TESTI20221790 

TESTI20222030//Homo sapiens very long-chain acyl-CoA synthetase (BG1) mRNA, complete cds.//6.10E-172// 
643aa//50%//AF1 79481 

TESTI20222460//DYNEIN GAMMA CHAIN, FLAGELLAR OUTER ARM.//1 .50E-138//589aa//46%//Q39575 
25 TESTI20223380 

TESTI20226520//TU BU LI N-TYRO SINE LIGASE (EC 6.3.2.25) (7TL).//6.70E-06//164aa//30%//P38160 
TESTI20227380//DEAD BOX PROTEIN 4 (VASA HOMOLOG) (RVLG).//1 .10E-263//577aa//86%//Q64060 
TESTI20228120//RHO-GTPASE-ACTIVATING PROTEIN 6 (RHO-TYPE GTPASE- ACTIVATING PROTEIN RHOG- 
APX-1 ) (FRAGMENT) .//4.50E-1 2//1 64aa//34%//054834 
30 TESTI20228740//ZINC FINGER PROTEIN 1 35.//3.50E-25//1 32aa//43%//P52742 

TESTI20244220//MYOSIN HEAVY CHAIN IB (MYOSIN HEAVY CHAIN IL).//6.50E-11//77aa//42%//P19706 
TESTI20244430//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2).//3.70E-15// 
173aa//35%//P16157 

TESTI20244460//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4.3) (ATP-AMP TRANSPHOSPHORYLASE).// 
35 3.60E-34//209aa//37%//F431 88 

TESTI20244730//Mus musculus alpha/beta hydrolase-1 mRNA, complete cds.//8.20E-113//266aa//81%//AF1 89764 

TESTI20245600//HYPOTHETICAL 118.4 KDA PROTEIN IN BAT2-DAL5 INTERGENIC REGION PRECURSOR.// 

4.50E-05//236aa//27%//P471 79 

TESTI20245860 
40 TESTI20246410 

TESTI20246480//Homo sapiens germline specific RNA binding protein (DAZL1) mRNA, complete cds.//5.00E-22// 
86aa//55%//U66726 

TESTI20247440//Human BLu protein testis isoform (BLu) mRNA, complete cds.//5. 40 E-45//91aa//96%//U70880 
TESTI20248850 

45 TESTI20249360//Homo sapiens DEME-6 mRNA, partial cds.//4.70E-94//299aa//56%//AF0071 70 
TESTI20250220/n"RICH0HYALIN.//5.40E-54//537aa//30%//P37709 

TESTI20250630//Columba livia mRNA for 5'-nucleotidase.//3.80E-115//328aa//66%//AJ131243 
TESTI20251440//Rattus norvegicus (rsec6) mRNA, complete cds.//3.80E-31//379aa//2896//U32575 
TESTI20251 61 0//INTER-ALPHA-TRYPSIN INHIBITOR HEAVY CHAIN H2 PRECURSOR (ITI HEAVY CHAIN H2).// 
50 1.1 0E-07//1 82aa//26%//002668 

TESTI20251740//FYN-BINDING PROTEIN (SLP-76 ASSOCIATED PROTEIN) (SLAP-130).//6.30E-16//88aa//50%// 
015117 

TESTI20252690//SEGMENT POLARITY PROTEIN DISHEVELLED. HOMOLOG DVL-3 (DISHEVELLED-3) (DSH HO- 
MOLOG 3) (KIAA0208).//4.60E-137//304aa//85%//Q92997 
55 TESTI20254030//Homo sapiens actin-binding double-zinc-finger protein (abLIM) mRNA, complete cds.//6.70E-150// 
280aa//96%//AF005654 
TESTI20254090 
TESTI20254480 
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TESTI20254990//ZINC FINGER PROTEIN GLI3 (NEURAL SPECIFIC DNA BINDING PROTEIN XGLI3) (XGLI-3).// 
6.50E-46//1 05aa//75%//Q91 660 

TESTI20255460//Mus musculus mRNA for MIWI (piwi), complete cds.//1 .50E-225//864aa//49%//AB032604 
TESTI20256560//NUF1 PROTEIN (SPINDLE POLY BODY SPACER PROTEIN SPC110).//5.70E-05//590aa//19%// 
5 P32380 

TESTI20257910//HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, ALPHA CHAIN G - PRECURSOR (HLA G ANTI- 
GEN).//1 .90E-1 22//223aa//1 00%//P1 7693 

TESTI20258720//ANKYRIN 1 (ERYTHROCYTE ANKYRIN) (ANKYRIN R) (ANKYRINS 2.1 AND 2.2).//1 .20E-1 0// 
233aa//27%//P16157 
10 TESTI20259110 
TESTI20259200 
TESTI20260140 

TESTI20260640//hellcase II homolog - fruit fly (Drosophlla sp.)//2.70E-27//374aa//27%//T13889 
TESTI20261040//DPY-19 PROTEINV/2.10E-47//316aa//34%//P34413 
15 TESTI20261160//Mus musculus rasGAP-activating-like protein mRNA, complete cds.//1 .60E-21//129aa//41%// 
AF086714 

TESTI20261680//HEAT SHOCK PROTEIN 30C.//1 .70E-08//136aa//27%//P30218 

TESTI20262150//Rattus norvegicus mRNA for voltage-gated ca channel, complete cds.//0//822aa//87%//AB0 18253 
TESTI20262940//TUBU LI N-TY ROSIN E LIGASE (EC 6.3.2.25) (TTL).//5.70E-1 0//218aa//25%//P38160 
20 TESTI20264530//PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1.-.//5.60E-18//219aa//34%// 
P49695 

TESTI20264910 
TESTI20265150 
TESTI20265340 
25 TESTI20265890 

TESTI20266050//52 KDA RO PROTEIN (SJOGREN SYNDROME TYPE A ANTIGEN (SS-A)) (RO(SS-A)).//2.10E- 
77//472aa//38%//P 1 9474 

TESTI20268240//Homo sapiens membrane-associated nucleic acid binding protein mRNA, partial cds.//1 .90E-52// 
41 2aa//36%//AF255303 
30 TESTI20269250 
TESTI20269360 

TESTI20270130//FIBRILLARIN.//2.10E-11//97aa//43%//Q22053 
TESTI20271790//Human p76 mRNA, complete cds.//6.9e-317//496aa//99%//U81006 
TESTI20272380 

35 TESTI20274960//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//7.90E-126//342aa//63%//P51522 
TESTI20277300 

TESTI20278280//Mus musculus p53 apoptosis-associated target (Perp) mRNA, complete cds.//3.40E-84//155aa// 
88%//AF249870 

TESTI20282420//MUS musculus EPCS26 mRNA, complete cds 7/2.40 E-19//122aa//3396//AF250838 
40 TESTI20282530//ZINC FINGER PROTEIN 135.//1 .60E-56//271aa//39%//P52742 
TESTI20282900 

TESTI20284260//HISTONE H2B F (H2B 291A).//2.10E-22//120aa//43%//P10853 

TESTI20285230//DOUBLE-STRANDED RNA-SPECIFIC EDITASE 1 (EC 3.5.-.-) (DSRNA ADENOSINE DEAMI- 
NASE)(RNA EDITING ENZYME 1).//2.10E-20//192aa//38%//P51400 
45 TESTI20286590//Human type 3 inositol 1 ,4,5-trisphosphate receptor (ITPR3) mRNA, complete cds.//1 .00E-93// 
185aa//100%//U01062 
TESTI20287760 

THYMU10004280//ZINC-FINGER PROTEIN HT2A (72 KDA TAT- INTERACTING PROTEIN)7/7.90E-13//87aa//42%// 
Q13049 

50 THYMU20006020//ISOCITRATE DEHYDROGENASE [NAD] SUBUNIT ALPHA, MITOCHONDRIAL PRECURSOR 

(EC 1.1.1.41) (ISOCITRIC DEHYDROGENASE) (NAD+-SPECIFIC ICDH).//2.60E-120//229aa//99%//P50213 

THYMU20007020 

THYMU20007750 

THYMU20008000 
55 THYMU20009460 

THYMU20009500 

THYMU20009710 

THYMU20010180//MOB1 PROTEIN (MPS1 BINDER 1)7/2. 60 E-31//1 36aa//50%//P40484 
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THYMU20010710 
THYMU20012020 
THYMU20012560 

THYMU20013250//LIM DOMAIN KINASE 2 (EC 2.7.1.-) (LIMK-2)7/3.50E-213//354aa//99%//P53671 
5 THYMU20013810//Human SEC7 homolog Tic (TIC) mRNA, complete cds.//2.30E-8B//138aa//96%//U63127 

THYMU20014430 

THYMU20017270 

THYMU20018250 

THYMU20018390 
10 THYMU20019000 

THYMU20019260//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1)7/6.80E-49//137aa//68%// 
Q03923 

THYMU20020370 

THYMU20020800//LMBR1 long form [Mus musculus]7/3.00E-69//198aa//55%//AAF91092 
15 THYMU20021090//Homo sapiens Sex comb on midleg homolog 1 isoform 1 (SCMH1) mRNA, complete cds.//4.40E- 
80//1 49aa//63%//AF1 49045 
THYMU20021540 

THYMU20023560//DNA BINDING PROTEIN RFX27/6.70E-25//59aa//94%//P48378 

THYMU20024500//N EURO FILAMENT TRIPLET M PROTEIN (160 KDA NEUROFILAMENT PROTEIN) (NF-M).// 
20 6.20E-06//296aa//22%//P16053 
THYMU20025480 

THYMU20026950//Mus musculus ROSA 26 transcription AS ROSA26AS mRNA, complete cds.//2.20E-12//285aa// 
23%//U83176 

THYMU20028150//AXONIN-1 PRECURSOR (AXONAL GLYCOPROTEIN TAG-1) (TRANSIENT AXON AL GLYCO- 
LS PROTEIN 1 ).//2.20E-33//301 aa//31 %//Q02246 

THYMU2002841 0//Mus musculus Pax transcription activation domain interacting protein PTIP mRNA, complete cds.// 

1 .70E-144//345aa//81%//AF1 04261 

THYMU20029830 

THYMU20030460//Homo sapiens tumor endothelial marker 7 precursor (TEM7) mRNA, complete cds7/2.10E-123// 
30 230aa//99%//AF279144 
THYMU20030690 

THYMU20031330//Homo sapiens putative nucleotide binding protein mRNA, complete cds.//1.00E-18//64aa//82%// 
AF1 18394 

THYMU20032820//ZINC FINGER PROTEIN 1357/1 .40 E-82//258aa//55%//P52742 
35 THYMU20034400//26S proteasome subunit p44.5 [Homo sapiens]//8.80E-29//71aa//91%//AB0031 02 
THYMU20034790 
THYMU20036500 

THYMU20039320//PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC 2.7.1 .-) 7/1 .40 E-09//206aa// 

27%//P49695 
40 THYMU20043440 

THYMU20043560 

THYMU20044100 

THYMU20044520 

THYMU20046350 
45 THYMU20046770 

THYMU20049060//SPLICING FACTOR, ARGININE/SERINE-RICH 2 (SPLICING FACTOR SC35) (SC-35) (SPLICING 

COMPONENT, 35 KDA) (PR264 PROTE!N)7/6.00E-41//11 9aa//76%//P30352 

THYMU20050010 

THYMU20051340 

50 THYMU20052460//PHORBOLIN I (FRAGMENTS).//5.80E-20//111aa//45%//P31941 

THYMU20052830//Homo sapiens mRNA for immunoglobulin lambda heavy chainV/1 . 50 E-237//477aa//90%//Y 14737 

THYMU20054800 

THYMU20055450 

THYMU20055460//ESTERASE D (EC 3. 1. 1. 1)7/7. 60 E-57//107aa//100%//P 10768 
55 THYMU20055740 

THYMU20055760//Mus musculus group IIF secreted phospholipase A2 (Pla2g2f) mRNA, complete cds.//1 .70E-73// 

1 65aa//75%//AF1 66099 

THYMU20058550 
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THYMU20060480 
THYMU20062520 

THYMU20062610//DYNEIN GAMMA CHAIN, FLAGELLAR OUTER ARM.//1.70E-156//585aa//50%//Q39575 
THYMU20062770//UROMODULIN PRECURSOR (TAMM-HORSFALL URINARY GLYCOPROTEIN) (THP).//4.20E- 
5 22//253aa//28%//P27590 

THYMU20063650//Homo sapiens mRNA for putative ribulose-5-phosphate-epimerase, partial cds.//9.30E-57//1 1 6aa// 

97%//AJ224326 

THYMU20064680 

THYMU20066660//Homo sapiens putative ATP-dependent RNA helicase ROK1 mRNA, complete cds.//9.20E-153// 
10 361aa//85%//AF077033 
THYMU20069130 
THYMU20069460 
THYMU20069650 

THYMU20070250//TRANSKETOLASE (EC 2.2.1.1) (TK).//4.50E-288//513aa//96%//P29401 
15 THYMU20071120//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//1 .40E-229//536aa//74%// 
Q03923 

THYMU20071460 
THYMU20072580 
THYMU20073070 
20 THYMU20073080 

THYMU20077250//T-CELL-SPECIFIC TRANSCRIPTION FACTOR 1 (TCF-1) (T-CELL FACTOR 1) (TRANSCRIP- 
TION FACTOR-7).//4.1 0E-90//1 76aa//96%//Q0041 7 
THYMU20078020 
THYMU20078240 
25 THYMU20079690 
THYMU20080490 

THYMU20081110//PINCH PROTEIN (PARTICULARY INTERESTING NEW CYS-HIS PROTEIN).//4.00E-28//60aa// 
86%//P48059 
THYMU20083390 
30 THYMU20083500 

THYMU20083830//Homo sapiens angiostatin binding protein 1 mRNA, complete cds.//2.30E-09//230aa//28%// 

AF286598 

THYMU20084520 

THYMU20086430 

35 THYMU20087270//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE MB (EC 3.6.1 .-.//7.20E-154//235aa// 
88%//P98195 
THYMU20089170 
THYMU20089900 

THYMU20090230//Homo sapiens ribonucleoprotein mRNA, complete cds.//9.30E-73//133aa//100%//L32610 
40 THYMU20091040 

THYMU20095920//Homo sapiens nuclear prelamin A recognition factor mRNA, complete cds.//2.90E-94//178aa// 
100%//AF1 28406 

TH YM U20096580//S E RYL-TR N A SYNTHETASE (EC 6.1.1.11) (SERINE-TRNA LIGASE) (SEARS).//9.90E-20// 
45aa//97%//P49591 

« THYMU20097920//mitogen inducible gene mig-2 - human//2.50E-1 97//477aa//55%//S69890 

THYMU20098350//KERATIN, TYPE II CYTOSKELETAL 5 (CYTOKERATIN 5) (K5) (CK 5) (58 KDA CYTOKERATIN).// 
3.40E-267//577aa//89%//P1 3647 

THYMU20099060//IG ALPHA- 1 CHAIN C REGION.//1 .70E-1 96//353aa//99%//P01 876 

THYMU20100940//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE IR (EC 3.6.1.-) (FRAG ME NT) 7/0// 
50 646aa//99%//Q9Y2G3 

THYMU20104480//TRICHOHYALIN7/2.90E-21//300aa//28%//P37709 

THYMU20106990//Mus musculus evectin-2 (Evt2) mRNA, complete cds7/2.20E-112//222aa//90%//AF189817 
THYMU20110720//PROBABLE AMINOTRANSFERASE T01B11.2 (EC 2.6.1 .-).//6.90E-40//211aa//41%//P91408 
THYMU201 12570 

55 THYMU20112590//Homo sapiens AP-4 adaptor complex beta4 subunit mRNA, complete cds.//3.40E-306//534aa// 
99%//AF092094 
THYMU201 15380 
THYMU201 15730 
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THYMU20117850 

THYMU20120240//Arabidopsis thaliana ubiquitin-specific protease (AtUBP4) mRNA, complete cds.//4.10E-18// 
179aa//28%//U76846 

THYMU201 20730//ALDEHYDE DEHYDROGENASE, DIMERIC NADP-PREFERRING (EC 1.2.1.5) (ALDH CLASS 
5 3)7/2.1 0E-203//379aa//99%//P30838 

THYMU20121040//ELONGATION FACTOR 1 -DELTA (EF-1-DELTA).//5.30E-149//281aa//99%//P29692 

THYMU201 28910 

THYMU201 29020 

THYMU201 30470 
10 THYMU201 34260 

THYMU201 37050 

THYMU201 37570 

THYMU20139160 

THYMU201 40510 

15 THYMU20143230//Homo sapiens mRNA for stabilin-1 (stabl gene).//1.70E-177//317aa//99%//AJ275213 
THYMU20145990//nesca protein [Homo sapiens].//1.90E-152//282aa//98%//NP_055143 
THYMU201 48010 
THYMU20 149230 
THYMU20150190 

20 THYMU20151610//Homo sapiens antigen NY-CO-1 (NY-CO-1) mRNA, complete cds.//1.40E-181//344aa//100%// 
AF039687 

THYMU20153210//Homo sapiens Diff33 protein homolog mRNA, complete cds.//4.00E-120//404aa//54%//AF1 64794 
THYMU 20 154790 
THYMU201 57620 
25 THYMU201 63600 

THYMU20170080//Homo sapiens SIT protein 7/9. 5 0E-48%/78aa//98%//AJ01 0059 

THYMU20170230//Homo sapiens sarcosine dehydrogenase (SARDH) mRNA, complete cds7/1.80E-183//260aa// 
99%//AF095735 
THYMU201 71580 
30 THYMU201 74490 
THYMU201 74790 
THYMU201 75260 

THYMU20176010//VEGETATIBLE INCOMPATIBILITY PROTEIN HET-E-1 7/1 .80E-20//256aa//28%//Q00808 
THYMU201 77070 

35 THYMU20178440//Homo sapiens mRNA for immunoglobulin lambda heavy chain7/2.20E-229//479aa//88%//Y14737 
THYMU20181890 
THYMU201 84550 
THYMU201 85470 

THYMU20185650//DIAPHANOUS PROTEIN HOMOLOG 17/1 .30E-20//85aa//44%//06061 0 
40 THYMU201 87210 

THYMU201 91 970//Homo sapiens FLAMINGO 1 mRNA, partial cds7/1 .80E-54//450aa//36%//AF234887 
TKIDN10000620//Homo sapiens Tax interaction protein 2 mRNA, partial cds.//1 .20E-56//114aa//100%//AF028824 
TKIDN1 0001 710 

TKIDN10001920//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) ( HTF 1)7/3. 20 E-97//226aa//73%// 
45 Q03923 

TR ACH200 1101 0//G LUCO AM YLAS E S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//4.30E-17//593aa//22%//P08640 
TRACH2001 1 540//TUMOR-ASSOCIATED ANTIGEN L6.//4.70E-57//113aa//97%//P30408 
TRACH20012490 
50 TRACH20021000 

TRACH2002 1 380//AD ENYLATE CYCLASE, TYPE V (EC 4.6.1.1) (ATP PYROPHOSPHATE-LYASE) (CA(2-»-)-INHIB- 
ITABLE ADENYLYL CYCLASE) 7/1 .20E-276//492aa//95%//P30803 
TRACH20025370 
TRACH20026640 
55 TRACH20029880 

TRACH20040390//MATERNAL PUMILIO PROTEIN7/1 .20E-177//812aa//47%//P25822 
TRACH20041090 

TRACH20043360//PUTATIVE KINESIN-LIKE PROTEIN C2F12.137/2.30E-55//333aa//39%//014343 



217 



EP 1 308 459 A2 



TRACH20044990 
TRACH20049500 
TRACH20051590 
TRACH20057200 
5 TRACH20058000 
TRACH20073990 
TRACH20080810 
TRACH20081270 

TR ACH20090060//S YN APTOTAG M I N IV7/1 .1 0E-12//301 aa//25%//P40749 
10 TRACH2009 1 070//ALD EHYDE DEHYDROGENASE, DIMERIC NADP-PREFERRING (EC 1.2.1.5) (ALDH CLASS 
3).//1.00E-173//328aa//98%//P30838 , 
TRACH20093400//TRICHOHYALIN7/2.30E-1 3//701 aa//20%//P37709 
TRACH20093480 

TRACH20098510//MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSCLE.//1 . 1 0E-20//640aa//21 %//P1 0587 
15 TRACH20101590 
TRACH201 04510 

TRACH20108240//ribonucleoprotein - African clawed frog//4.10E-118//223aa//96%//S40774 

TRACH20113020//SELENIDE, WATER DIKINASE 2 (EC 2.7.9.3) (SELENOPHOSPHATE SYNTHETASE 2) (SELE- 
NIUM DONOR PROTEIN 2).//1 .80E-207//364aa//96%//Q99611 
20 TRACH201 22980//HYPOTH ETICAL PROTEIN MJ0798V/6.80E-13//211aa//22%//Q58208 
TRACH201 23870 
TRACH201 24970 
TRACH201 25620 
TRACH20129180 

25 TRACH20131230//Homo sapiens oxysterol binding protein-related protein 3 (ORP3) mRNA, complete cds.//7.50E- 
282//608aa//62%//AY008372 
TRACH201 39280 
TRACH20140180 

TRACH201 4371 0//RAB GERANYLGERANYLTRANSFERASE ALPHA SUBUNIT (EC 2.5. 1 .-) (RAB GERANYL-GER- 
30 ANYLTRANSFERASE ALPHA SUBUNIT) (RAB GG TRANSFERASE) (RAB GGTASE).//9.70E-07//142aa//33%// 
Q92696 

TRACH20149500//KERATIN, TYPE I CUTICULAR HA6 (HAIR KERATIN, TYPE I HA6).//1 .10E-62//215aa//60%// 
076013 

TRACH201 49720 

35 TR ACH201 49740//EXCITATO RY AMINO ACID TRANSPORTER 5 (RETINAL GLUTAMATE TRANSPORTER).// 
5.00E-76//1 52aa//98%//000341 
TRACH201 58240 
TRACH201 59390 
TRACH201 60800 

40 TRACH20163470//Mus musculus putative thymic stromal co-transporter TSCOT mRNA, complete cds.//3.20E-41// 
187aa//34%//AF148145 

TRACH20164100//RETROVIRUS-RELATED PROTEASE (EC 3.4.23.-)7/1 .40E-32//113aa//56%//P 10265 
TRACH201 64810 
TRACH201 65330 

45 TRACH20165540//Human alpha-1 type I collagen gene surrounding osteogenesis imperfecta Ol type II deletion.// 
4.00E-05//1 02aa//37%//M1 11 62 
TRACH201 67090 

TRACH20170860//IG DELTA CHAIN C REGION.//1 .60E-212//383aa//100%//P01880 
TRACH20173680//Homo sapiens mRNA for LAK-4p, complete cds.//3.50E-80//410aa//38%//AB002405 
50 TRACH201 74980 
TRACH201 82780 
TRACH20185120 

TRACH20188350//Homo sapiens mRNA forcentaurin beta27/3.80E-60//204aa//56%//AJ238248 
TRACH20190460//MONO- AND DIACYLGLYCEROL LIPASE PRECURSOR (EC 3.1 .1 .-) (MDGL).//2.60E-11//195aa// 
55 28%//P25234 

UMVEN1 0001 380 

UTERU10001 060//ETS-DOMAIN PROTEIN ELK-1 .//1 .40E-39//88aa//93%//P19419 
UTERU1 0001 870 
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UTERU20000230 

UTERU20000950//Homo sapiens PC326 protein (PC326) mRNA, complete cds7/2.80E-55//112aa//100%//AF1 50734 

UTERU20011760 

UTERU20013890 

5 UTERU20016580//POTENTIAL TRANSCRIPTIONAL ADAPTOR 7/1 .90E-37//323aa//31%//Q02336 

UTERU20026620//ZINC FINGER PROTEIN 757/7.50E-82//174aa//82%//P51815 

UTERU20027360 

UTERU20029930 

UTERU20031350 
10 UTERU20035770 

UTERU20040150 

UTERU20040370 

UTERU20040390 

UTERU20040730 

15 UTERU20041630//ZINC FINGER PROTEIN 184 (FRAGMENT).//3.30E-75//239aa//44%//Q99676 
UTERU20041970 
UTERU20045200 

UTERU20051790//guanytate kinase-interacting protein 1 Maguin-1, membrane -associated - rat//8.20E-26//267aa// 
33%//T18293 

20 UTERU20064120//MYELOID UPREGULATED PROTEIN.//1 .30E-79//245aa//66%//035682 
UTERU20065470 
UTERU20079240 
UTERU20083020 

UTERU20086530//GLYCODELIN PRECURSOR (GD) (PREGNANCY-ASSOCIATED ENDOMETRIAL ALPHA-2 
25 GLOBULIN) (PEG) (PAEG) (PLACENTAL PROTEIN 14) (PROGESTERONE-ASSOCIATED ENDOMETRIAL PRO- 
TEIN) (PROGESTAGEN-ASSOCIATED ENDOMETRIAL PROTEIN) 7/6. 00 E-73//140aa//99%//P09466 
UTERU20087070//COMPLEMENT C1R COMPONENT PRECURSOR (EC 3.4.21 .41)7/4. 80 E-206//360aa//99%// 
P00736 

UTERU20087850//Homo sapiens mRNA for mucolipidin (ML4 gene)7/2.70E-34//184aa//42%//AJ293970 
30 UTERU20089300 

UTERU20089390//Plectonema boryanum kinesin light chain (KLC) gene, complete cds.//3.30E-38//216aa//43%// 
U 78597 

UTERU20089620//Homo sapiens radical fringe (RFNG) gene, partial cds7/1 .50E-31//65aa//100%//AF108139 
UTERU20090940 
35 UTERU20091470 

UTERU20094830//SINGLE-MINDED HOMOLOG 2 (SIM TRANSCRIPTION FACTOR) (MSIM)7/8.70E-09//427aa// 

25%//Q61079 

UTERU20095100 

UTERU20099040//ZINC TRANSPORTER 2 (ZNT-2).//9.80E-96//242aa//76%//Q62941 
40 UTERU20099510//ZINC FINGER PROTEIN 1 357/2. 20E-107//346aa//54%//P52742 

UTERU201 41 1 50//NUCLEAR FACTOR NF-KAPPA-B P49 SUBUNIT7/2.20E-07//76aa//47%//Q04860 
UTERU201 02260 
UTERU201 03040 
UTERU201 03200 

« UTERU20104310//DNA-DIRECTED RNA POLYMERASE II 14.4 KDA POLYPEPTIDE (EC 2.7.7.6) (RPB6) 
(RPB1 4.4)7/1 .80E-48//97aa//100%//P41584 
UTERU201 06510 
UTERU20121140 

UTERU20122520//MOESIN (MEMBRANE-ORGANIZING EXTENSION SPIKE PROTEIN)7/6.00E-37//399aa//28%// 
50 P26042 

UTERU201 25810 

UTERU20127030//LAMININ BETA-2 CHAIN PRECURSOR (S-LAMININ).//6.80E-175//377aa//92%//P55268 
UTERU20127150 

UTERU20128560//26.4 KDA PROTEIN IN RUVC-ASPS INTERGENIC REGION7/2.60E-17//120aa//34%//P24237 
55 UTERU20132620//AXONEME-ASSOCIATED PROTEIN MST1 01 (2)7/1 .40E-15//231 aa//31%//Q08696 

UTERU20134830//pellino (Drosophila) homolog 2 [Homo sapiens] //1 .40E-153//361aa//729o//NP_067078 
UTERU20139760//solute carrier family 25 (mitochondrial carrier; peroxisomal membrane protein, 34kD), member 17 
[Homo sapiens]//5.40E-1 00//203aa//97%//XP_001 1 36 
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UTERU201 40010 
UTERU201 67570 

UTERU201 68960//Homo sapiens actin filament associated protein (AFAP) mRNA, complete cds.//2.60E-68//364aa// 
43%//AF1 88700 

5 UTERU20169020//HOMEOBOX PROSPERO-LIKE PROTEIN PROX1 (PROX 1)7/1 .30E-54//117aa//74%//Q91018 
UTERU201 73030 
UTERU201 76230 

UTERU20177150//Homo Sapiens zinc finger protein dp mRNA, complete cds.//4.60E-10//104aa//40%//AF1 53201 
UTERU20181270 

io UTERU20185220//Human mRNA for transcriptional activator hSNF2a, complete cds.//1.60E-125//246aa//98%// 
D26155 

UTERU20188670//HFM1 PROTEIN.//5.10E-19//234aa//26%//P51979 
UTERU201 88840 

15 

Claims 

1. A polynucleotide selected from the group consisting of the following (a) to (g) : 

20 (a) a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID NOs: 

1 to 1970; 

(b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 
1971 to 3940; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid 
25 sequence of any one of SEQ ID NOs; 1 971 to 3940, wherein, in said amino acid sequence, one or more amino 

acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes 
a polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing to a polynucleotide comprising the nucleotide sequence of any one of SEQ 
ID NOs: 1 to 1970, wherein said nucleotide sequence encodes a polypeptide functionally equivalent to a 

30 polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypep- 
tide encoded by the polynucleotide according to any one of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence 
of any one of SEQ ID NOs: 1 to 1970; and 

3$ (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence 

of any one of SEQ ID NOs: 1 to 1970. 

2. A polypeptide encoded by the polynucleotide of claim 1 , or a partial peptide thereof. 
40 3. An antibody binding to the polypeptide or the peptide of claim 2. 

4. A method for immunologically assaying the polypeptide or the peptide of claim 2, said method comprising the steps 
of contacting the polypeptide or the peptide of claim 2 with the antibody of claim 3, and observing the binding 
between the two. 

45 

5. A vector comprising the polynucleotide of claim 1 . 

6. A transformant carrying the polynucleotide of claim 1 or the vector of claim 5. 

50 7. A transformant carrying the polynucleotide of claim 1 or the vector of claim 5 in an expressible manner. 

8. A method for producing the polypeptide or the peptide of claim 2, said method comprising the steps of culturing 
the transformant of claim 7 and recovering an expression product. 

55 9. An oligonucleotide comprising at least 15 nucleotides, said oligonucleotide comprising a nucleotide sequence 
complementary to the nucleotide sequence of any one of SEQ ID NOs: 1 to 1970 or to a complementary strand 
thereof. 



220 



EP 1 308 459 A2 

10. Use of the oligonucleotide of claim 9 as a primer for synthesizing the polynucleotide of claim 1 . 

11. Use of the oligonucleotide of claim 9 as a probe for detecting the polynucleotide of claim 1 . 

12. An antisense polynucleotide against the polynucleotide of claim 1 or a part thereof. 

13. A method for detecting the polynucleotide of claim 1 , said method comprising the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide of claim 9 under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide of claim 9. 

14. A database of polynucleotides and/or polypeptides, said database comprising information on at least one of the 
nucleotide sequences of SEQ ID NOs: 1 to 1970 and/or on at least one of the amino acid sequences of SEQ ID 
NOs: 1971 to 3940. 
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Figure 1 
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